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TGCCOCCTTC 
TAACACTCTQ 
ATAAGTAATG 



TTTSATATTC 
TATAACAGTC 
TT3AATAXTG 
TCSTTOCGCT 



ACTCTOTTGT 
TGTGAGSAAA 
GATAAGCCTG 



ATTTCATCTC 
TAl.iU,lAT 
TGTGTCCTTT 
CAGTGACAGG 



TTCtTGCCGA 
TTCTTCTCAT 



TCAAACGATA 
CAGTGTTTTT 
ACA AAGCCCA 
TTGCCTGTTT 



CTCTATGTTT TOACAGCTCA 
TTTCATTCCC 
GX7GATCGOA 
TACACCAGAT 
AGISAGCQTT 
ATTTTTTACC 
ATC7TGATAA 
TATATCGCAT 
GT5TGTACAG 



AACTAACCTO 
GAAACATTTC 



TA2GGAAAAC 
TAGCTGTTCT 
CCTTTOAAGT 
TTCTAATCGG 
CCTAGTTAAC 
CTA GTAAAAA 
TCTATGCTCT 
ATATGCAGTC 



ACTTGACCTT 
ACCTTATTAA 
CTCTOSTTTI 
TCATGATCTA 
CTTCGGATTT 
TTATACAAAT 
CTCAAACACA 
TCTTGTGTTG 



ATOCTCAGTA AS5CGGGTTG TCACAIGGCT 
ACTAAACTCC 
CCTGCATATT 
TCCATOSATT 
AGTTGCAGCA 
CAGCTCTTTA 
CCTGTTCCCA 
TTTCAAACCA 



CCCACGOAAG TGtACATCCA 
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ACTGCATAGG. 
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C ATTT T TA TO 
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CG3CACTGCC 
CAOGCTGCCA 



GCCTTTGGTX 
GTGTGGCTTG 
GCTTGATGCT 
AACTGGTGAO 
CAAATGCTTS 
TATACTTGTC 
TGAXTTCCTT 
ATTTTTATTT 
GCCTTAAATT 
AACGQGCACQ 
AOGTATAAAT 
AATOAATACT 
TTAAATTATT 
TCACAGCGTT 
TTGCACTGAC 
CACAOCCACC 
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(57) Abstract: The present invention relates to novel 
methods of producing transgenic avians, preferably 
chickens, wherein the incorporated transgene may 
be expressed as a constituent protein of the white 
of a hard-shell egg. The present invention provides 
sperm-mediated transfer for the introduction to an avian 
egg of a transgene encoding a heterologous polypeptide. 
The avian sperm may be irradiated before the transgenic 
gene is incorporated therein. Transgenic genes may 
be incorporated into avian sperm by lipofection, 
electro poration. restriction enzyme mediated integration 
(REMI) or similar methods. The modified avian 
sperm may then be delivered to an avian oocyte by 
microinjection, intracytoplasmic sperm injection (ICSI) 
or artificial insemination, or by natural coitus after 
the modified avian sperm are returned to a male bird. 
Heterologous nucleic acid may be integrated directly 
into the genomic nucleic acid of the oocyte or after first 
integrating the heterologous nucleic acid into the nucleic 
acid of a male germ cell and subsequent delivery of the 
transgenic male germ cell to an oocyte. Alternatively, 
the heterologous nucleic acid may be a episome within 
the sperm, or within the derivative zygote formed by 
the fusion of the sperm and the recipient oocyte, and 
may replicate independently of the zygote genome. 
Co-segregation of the episome with the replicated ooctye 
genome into all of the daughter cells may be induced by 
the heterologous nucleic acid having a centromeric body 
derived from, for example, a chromosome of a chicken. 
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PRODUCTION OF TRANSGENIC AVIANS USING SPERM-MEDIATED 

TRANSFECTION 

This application claims the benefit of United States Provisional Patent Application 
5 . Serial No. 60/323,961, filed September 21, 2001, and United States Provisional Patent 
Application Serial No. 60/324,001, filed September 21, 2001, both of which are 
incorporated by reference herein in their entireties. 

1. FIELD OF THE INVENTION 

10 The present invention relates to methods of producing a transgenic avian by 

introducing a nucleic acid encoding a heterologous protein into the genome of an avian 
oocyte by sperm-mediated transfection. The present invention further relates generally to a 
transgenic avian capable of expressing a heterologous polypeptide, which, preferably is 
deposited into the white of an avian egg, said avian generated by sperm-mediated 

15 transgenesis. The invention further provides vectors containing coding sequences for 
heterologous proteins, the expression of which is under the control of a promoter and other 
regulatory elements that cause expression of the heterologous protein and preferably, lead to 
deposition of the protein in the avian egg. Also included in the invention are avian eggs 
derived from the transgenic avians and the heterologous proteins isolated therefrom. 

20 

2. BACKGROUND 
The field of transgenics was initially developed to understand the action of a single 
gene in the context of the whole animaland the phenomena of gene activation, expression, 
and interaction. Hie technology has also been used to produce models for various diseases 

25 in humans and other animals and is amongst the most powerful tools available for the study 
of genetics and the understanding of genetic mechanisms and function. From an economic 
perspective, however, the use of transgenic technology for the production of specific 
proteins or other substances of pharmaceutical interest offers significant advantages over 
more conventional methods of protein production by gene expression. (Gordon et aL 9 1987, 

30 Biotechnology 5: 1183-1187; Wilmute/ai, 1990, Theriogenology 33: 113-123). 

In particular, the production of monoclonal antibodies by traditional methods is 
labor-intensive and costly. The purification of monoclonal antibodies from serum is a slow 
and low-yielding process. The use of hybridomas cell lines which were developed by fusing 
a B lymphocyte with a myeloma cell to propagate indefinitely in vivo as ascites, or in vitro 

35 in tissue culture requires major expenditures in tissue culture facilities or mice breeding. 
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(Kohler and Milstein, 1975, Nature 256: 495-497). Although various strategies have heen 
proposed to overcome the deficiencies in antibody yield {e.g., engineering single-chain 
antibodies (scAb) comprising immunoglobulin heavy and light chain variable regions), no 
method has been proven entirely satisfactory in elevating antibody yields to the levels 

5 desired for adequate commercial production. 

The industry has been experimenting with transgenic animals that can express, for 
example, an exogenous protein such as an antibody under conditions that offer high yield of 
the protein in an active form while incorporating post-transiational modifications, such as 
glycosylation, typically required for full functionality of the antibody. In this context, 

10 heterologous nucleic acids have been engineered so that an expressed protein may be joined 
to a protein or peptide that will allow secretion of the transgenic expression product into 
milk or urine, fiom which the protein may then be recovered. These procedures have had 
limited success, however, and may require lactating animals, with the attendant costs of 
maintaining individual animals or herds of large species, including cows, sheep or goats. 

15 

Avian Transgenics 

One transgenic system that holds potential is the avian reproductive system. The 
exogenous protein can be produced in the white of an avian egg from which it may be 
readily purified. (MacArthur, PCT Publication WO 97/47739). The production of an avian 

20 egg begins with formation of a large yolk in the ovary of the hen. The unfertilized oocyte or 
ovum is positioned on top of the yolk sac. After ovulation, the ovum passes into the 
infundibulum of the oviduct where it is fertilized, if sperm are present, and then moves into 
the magnum of the oviduct, lined with tubular gland cells. These cells secrete the egg-white 
proteins, including ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and 

25 ovomucin, into the lumen of the magnum where they are deposited onto the avian embryo 
and yolk. 

The hen oviduct, for example, can serve as an excellent protein bioreactor because 
of the high levels of protein production, the promise of proper folding and post-translation 
modification of the target protein, the ease of product recovery, and the shorter 

30 developmental period of chickens compared to other potential animal species. The 
economic advantage of breeding flocks of transgenic birds laying eggs expressing 
exogenous proteins would be significant when compared to more traditional animals, such 
as cows, sheep or goats, producing heterologous protein in milk. What is needed, however, 
is an efficient method of introducing a heterologous nucleic acid into a recipient avian 

35 embryonic cell. 
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Vectors 

Genetic information has been transferred to avian embryos using vectors. 
Bosselman et al in U.S. Patent No. 5,162,21 5 describes a method for introducing a 
replication-defective retroviral vector into a phiripotent stem cell of an unincubated chick 
, 5 embryo, and further describes chimeric chickens whose cells express a heterologous vector 
nucleic acid sequence. However, the percentage of Gl transgenic oflspring (progeny from 
vector-positive male GO birds) was low and varied between 1% and approximately 8%. 
In addition, the use of viral vectors poses limitations, including limitations on transgene size 
and potential viral infection of the offspring, thus, posing significant regulatory issues for 
10 production of biologies. 

Similarly, Jaenisch reported that while retroviral vectors did transfer genetic 
information to embryos, the resulting animals were mosaics with gene insertions at various 
loci in the genomic nucleic acid. (1976, Proc. Natl Acad Set USA 73: 1260-1264). The 
transgenes were also differentially expressed in the different tissues of each animal. 
15 (Jaenisch, 1980, Cell 19: 181-188). 

Nuclear Transfer 

Nuclear transfer from cultured cell populations is another route to produce 
transgenics, wherein donor cells may be sexed, optionally genetically modified, and then 

20 selected in culture before their use. The resultant transgenic animal originates from a single 
transgenic nucleus and therefore, mosaics are avoided. Nuclear transfer from cultured 
somatic cells also provides a route for directed genetic manipulation of animal species, 
including the addition or "knock-in" of genes, and the removal or inactivation or "knock- 
out" of genes or their associated control sequences (Polejaeva et a!., 2000, Theriogenology 

25 53:117-26). 

Two types of recipient cells are commonly used in nuclear transfer procedures: 
oocytes arrested at the metaphase of the second meiotic division (ME) and which have a 
metaphase plate with the chromosomes arranged on the meiotic spindle, and pronuclear 
zygotes. In agricultural mammals, however, development does not always occur when 

30 pronuclear zygotes are used, and, therefore, MC-arrested oocytes are the preferred recipient 
cells. Enucleated two-cell stage blastomeres of mice have also been used as recipients. 

After enucleation and introduction of donor genetic material, the reconstructed 
embryo is cultured to the morula or blastocyte stage, and transferred to a recipient animal, 
either in vitro or in vivo, and developed to term. (Eyestone and Campbell, 1999, 1 Reprod 

35 Fertil Suppl 54: 489-97). Double nuclear transfer has been reported in which an activated, 
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previously transferred nucleus is removed from the host unfertilized egg and transferred 
again into an enucleated fertilized embryo. Activation (initiation of development) is most 
often induced chemically. Cultured cells can also be frozen and stored indefinitely for 
future use. 

5 Although gene targeting techniques combined with nuclear transfer hold tremendous 

promise for nutritional and medical applications, current approaches suffer from several 
limitations, including long generation times between the founder animal and production 
transgenic herds, and extensive husbandry and veterinary costs. It is therefore desirable to 
use a system where cultured somatic cells for nuclear transfer are more efficiently 

10 employed. 

Sperm-Mediated Transfection Mechanism 

A promising method for producing transgenic animals is the stable transfection of 
male germ cells in vitro and their transfer to a recipient oocyte. PCT Publication WO 

15 87/05325 discloses a method of transferring organic and/or inorganic material into sperm or 
egg cells by using liposomes. Bachiller et al. used Lipofectin-based liposomes to transfer 
DNA into mice sperm, and provided evidence that the liposome transfected DN A was 
overwhelmingly contained within the sperm's nucleus. (1991, Mot Reprod Develop. 30: 
194-200). However, no transgenic mice could be produced by this technique. 

20 Similarly, Nakanishi and Iritani used Lipofectin-based liposomes to associate 

heterologous DNA with chicken sperm, which were in turn used to artificially inseminate 
hens. (1993, MoL Reprod Develop. 36:258-261). Although the heterologous DNA was 
detectable in many of the resultant fertilized eggs, there was no evidence of genomic 
integration of the heterologous DNA either in the DNA-liposome treated sperm or in the 

25 resultant chicks. 

Heterologous DNA may also be transferred into sperm cells by a process called 
electroporation that creates temporary, short-lived pores in the cell membrane of living cells 
by exposing them to a sequence of brief electrical pulses of high field strength. The pores 
allow cells to take up heterologous material such as DNA, while only slightly compromising 

30 cell viability. Gagne et oTdiscloses the use of electroporation to introduce heterologous 
DNA into bovine sperm subsequently used to fertilize ova. (1991, MoL Reprod Develop. 
29: 6-15). However, there was no evidence of integration of the electroporaled DNA either 
in the sperm nucleus or in the nucleus of the egg subsequent to fertilization by the spernL 

Yet another method initially developed for integrating heterologous DNA into yeasts 

35 and slime molds, and later adapted to avian sperm, is restriction enzyme mediated 



WO 03/024199 



PCTAJS02/30156 



integration (REMT), which utilizes a linear DNA derived from a plasmid DNA by cutting 
that plasmid with a restriction enzyme that generates single-stranded cohesive ends. 
(Shemesh et al , PCT International Publication WO 99/42569). The linear, cohesive-ended 
DNA together with the restriction enzyme used to produce the cohesive ends is then 

5 introduced into the target cells by electroporation or liposome transection. The restriction 
enzyme is then thought to cut the genomic DNA at sites that enable the heterologous DNA 
to integrate via its matching cohesive ends. (SchiestTand Petes, 1991, Proa Natl. Acad 
Set USA 88: 7585-7589). Although Shemesh described transgenic birds that were resistant 
to Infectious Bursal Disease, there was no evidence of expression or deposition of a 

10 heterologous protein in the oviduct for deposition onto egg whites. 

What is needed, therefore, is an efficient method of generating a transgenic avian 
capable of expressing a heterologous protein coded by a transgene, particularly in the 
oviduct for deposition into egg whites. 

15 3. SUMMARY OF THE INVENTION 

The invention provides methods for the stable introduction by sperm-mediated 
transfection of heterologous coding sequences into the genome of an avian, preferably a 
chicken, and expressing those heterologous coding sequence to produce desired proteins 
and/or to alter the phenotype of the transgenic avian. Synthetic vectors and gene promoters 
20 useful in the methods are also provided by the present invention, as are transgenic avians 
that express a heterologous protein and avian eggs, preferably chicken eggs, containing a 
. heterologous protein. In a preferred embodiment, the vectors useful in methods of the 
invention are not eukaryotic viral, more preferably not retroviral, vectors (although the 
vectors may contain transcriptional regulatory elements, such as promoters, from eukaryotic 
25 viruses). In other embodiments, however, the vectors are retroviral vectors. 

. One aspect of die present invention is a method of producing a transgenic avian, 
. preferably a chicken, by introducing in an avian oocyte at least one transgene encoding at 
least one heterologous polypeptide by sperm-mediated transfection. The method comprises 
first, isolating an avian sperm, second, incorporating a transgene into the avian sperm, and 
30 third, delivering the modified avian sperm to an avian oocyte. In one embodiment, the 
avian sperm is irradiated with gamma rays before the transgene is incorporated therein. 

In one embodiment, the transgene is injected directly into die testis of a male avian 
and incorporated in the avian sperm. The modified sperm is then delivered to the avian 
oocyte by mating the male avian with a wild type or transgenic female avian. 
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In another embodiment, the transgene is incorporated in the avian sperm in vitro by 
Iipofection, electroporation, restriction enzyme mediated integration (REMT) or similar 
methods. In a preferred embodiment, the modified avian spenn is then delivered to the 
avian oocyte by natural coitus after the modified avian spenn are returned to the testis of a 

5 male avian. In another preferred embodiment, the modified avian sperm is delivered to the 
avian oocyte by microinjection (e.g., intracytoplasmic sperm injection (ICIS) or standard 
artificial fertilization methods). The resulting transgenic embryo can then be transferred to 
the oviduct of a recipient hen for development and to be laid as a shelled egg (or, 
alternatively, cultured ex vivo). The shelled egg is incubated to hatch a transgenic avian that 

10 has incorporated, preferably integrated into its genome, the selected nucleic acid. In 
preferred embodiments, the avian sperm is first irradiated before incorporated with the 
transgene. 

In certain embodiments, a transgene comprising a heterologous nucleic acid may be 
integrated directly into the genomic nucleic acid of an avian sperm and subsequently 

15 delivered to an avian oocyte. When the heterologous nucleic acid is directly integrated into 
the genome of the avian sperm which then fertilizes an avian oocyte, the resulting 
transgenic embryo will include the transgenic heterologous nucleic acid in all of its cells. In 
preferred embodiments, the transgenic heterologous nucleic acid is incorporated into at least 
one embryonic cell, preferably the germinal disk of an early stage embryo, that then develop 

20 into a transgenic avian- 

Alternatively, the heterologous nucleic acid may be an episome within the modified 
avian sperm, or within the derivative zygote formed by the fusion of the modified avian 
sperm and the avian oocyte. The episome may replicate independently of the zygote 
genome. When the heterologous nucleic acid is episomal with respect to the genome of the 

25 transgenic zygote, and the episomal nucleic acid has a centromeric body, most, if not all, of 
the cells of the transgenic embryo will include the heterologous nucleic acid. Accordingly, 
in preferred embodiments, fee transgene further comprises centromere and/or telomere 
sequences of an avian chromosome. 

The invention further provides method for incorporating at least one transgene into 

30 the genome of a spermatozoon cell or a precursor thereof isolated from a donor male avian, 
and returning the modified cell to the testis of a recipient male avian, preferably the donor 
male avian, so that a genetically modified male gamete is produced by the male avian. 
Breeding the male avian with a female of its species will generate a transgenic progeny 
carrying the at least one transgene in its genome. 

35 
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The invention also provides methods for introducing a heterologous nucleic acid to 
an avian oocyte in addition to those described in United States Application Serial No. 
09/877,374, filed June 8, 2001, entitled "Production of Monoclonal Antibody By a 
Transgenic Chicken", by Jeffrey C. Rapp; and United States Application Serial No, 

5 , filed September 1 8, 2002, entitled "Production of a Transgenic Avian By 

Cytoplasmic Injection*', by Jeffrey C. Rapp and Leandro Christmann, both of which are 
incorporated by reference herein in their entireties. In certain embodiments, the avian 
oocyte is removed from the ovaries of a donor female avian to facilitate in vitro fertilization 
by the modified avian sperm of the invention. In other certain embodiments, the modified 

10 avian sperm is delivered to an avian oocyte in vivo by natural coitus. The fertilized ova is 
then, preferably, returned to or maintained in the oviduct of the donor female avian or a 
surrogate female avian to be laid as a hard-shell egg or, as an alternative, cultured ex vivo. 
The hard-shell egg is incubated and hatched, producing a transgenic chick that expresses a 
heterologous protein and/or that can be bred to generate a line of transgenic avians 

15 expressing a heterologous protein. 

Preferably, the avian sperm or the reproductive system of a male avian, preferably 
the seminiferous tubules and/or site of sperm production, development, and/or storage in the 
testis, is irradiated by gamma rays before transgene incorporation. More preferably, the 
v transgene is integrated directly into the genome of the avian sperm. Most preferably, the 

20 transgene further comprises centromere and/or telomere sequences. 

In particular embodiments, the level of mosaicism of the transgene (percentage of 
cells con taining the transgene) in avians hatched from sperm-mediated transfected embryos 
(ie, the GOs) is greater than 5%, 1 0%, 25%, 50%, 75% or 90%, or is the equivalent of one 
copy per one genome, two genomes, five genomes, seven genomes or eight genomes, as 

25 determined by any number of techniques known in the art and described infra. In 
additional particular embodiments, the percentage of GOs that transmit the transgene to 
progeny (Gls) is greater than 5%, preferably, greater than 10%, 20%, 30%, 40%, and, most 
preferably, greater than 50%. 

In certain other embodiments, the level of transgenics that result from matin g with a 

30 wild type or transgenic avian avians hatched from sperm-mediated transfected embryos (L e. , 
the GOs) is greater than 5%, 10%, 25%, 50%, 75% or 90%. 

In another embodiment, the present invention provides methods, for producing 
heterologous proteins in avians. Transgenes are introduced by sperm-mediated transfection 
into the genome of an avian oocyte which becomes fertilized and then develops into a 

35 transgenic avian. The heterologous protein(s) of interest may be expressed in the tubular 
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gland cells of the magnum of the oviduct, secreted into the lumen, and most preferably, 
deposited within the egg white onto the egg yolk or expressed, for example, in the serum of 
the avian. In preferred embodiments, the level of expression of the heterologous protein in 
the egg white of eggs laid by GO and/or Gl chicks and/or their progeny is greater than 5 (ig, 
5 10 ^g, 50 tig, 100 \ig 9 250 jig, 500 ng or 750 fig, more preferably greater than 1 mg, 2 mg, 5 
mg, 10 mg, 20 mg, 50 mg, 100 mg, 200 mg, 500 mg, 700 mg, 1 gram, 2 grains, 3 grams, 4 
grams or 5 grams. 

The transgenic avians can also be bred to identify those avians that carry the 
transgene in their germ line. The exogenous gene coding for the heterologous proteins can 

1 0 therefore be transmitted by sperm-mediated transfection of the exogenous gene into the 
avian oocytes, and by subsequent stable transmission of the exogenous gene to the avian's 
offspring in a Mendelian fashion. More information on Mendelian inheritance can be found 
in Hard and Jones, 2001, Genetics: Analysis of Genes and Genomes, 5th ed., Jones & 
Bartlett Publishers, Inc., the content of which is incorporated by reference herein in its 

15 entirety. 

Another aspect of the invention provides for the isolation of heterologous proteins in 
transgenic avians and the use thereof in pharmaceutical products including but not limited 
to vaccines, biologies and, particularly, therapeutically or diagnostically useful antibodies. 
The expressed heterologous protein(s) of interest may be collected and processed using 

20 standard techniques from the avian eggs, preferably the egg white, the serum, or other 
tissues from the transgenic avian. 

The present invention further provides methods for producing a heterologous protein 
in an avian oviduct The method comprises, as a first step, providing a vector containing a 
coding sequence and a promoter that functions in avians, preferably in the avian magnum, 

25 operably linked to the coding sequence, so that the promoter can effect expression of the 
nucleic acid in the tubular gland cells of the magnum of an avian oviduct and/or in any other 
desired tissue of the avian. In a preferred embodiment, the vector containing the transgene 
is not a eukaryotic viral vector (preferably, not a retroviral vector, such as but not limited to 
reticuloendotheliosis virus (REV), ALV or MMLV) or derived from a eukaryotic virus (but, 

30 in certain embodiments, may contain promoter and/or other gene expression regulatory 
sequences from a eukaiyotic virus, such as, but not limited to, a cytomegalovirus promoter). 
Next, the vector is introduced into avian sperm in vitro by lipofection, electroporation, 
restriction enzyme mediated integration (REMT) or similar methods, or in vivo by directly 
injecting into the testis, so that the vector sequence may be incorporated into the avian 

35 sperm. In preferred embodiments, the avian sperm or precursor cells are irradiated by 
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gamma rays before the vector sequence is incorporated therein. In another preferred 
embodiment, the vector sequence further comprises centromere and/or telomere sequence. 
Then, the modified avian sperm are delivered to an avian oocyte by natural coitus or in vitro 
by microinjection or artificial insemination to form a transgenic embryonic cell. In certain 

5 embodiments, the recipient avian oocyte is wild type unmodified or preferably, modified in 
a manner that facilitates the delivery of transgene by the modified avian sperm. In certain 
other embodiments, the recipient avian oocyte is derived from a first-generation or 
preferably, second-generation transgenic avian whose germ-line carries the transgene. 
Finally, a mature transgenic avian that expresses the exogenous protein in its oviduct is 

1 0 derived from the transgenic embryonic cell or by breeding a transgenic avian derived from 
the transgenic embryonic cell. 

The present invention further provides promoters useful for expression of the 
heterologous protein in the egg. For example, the transgene may comprise regions of at 
least two promoters derived from an avian including, but not limited to, an oviduct-specific 

15 promoter such as ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and 
ovomucin promoter or any other promoter that directs expression of a gene in an avian, 
particularly in a specific tissue of interest, such as the magnum, and a protamine promoter, 
or a fragment thereof which is sufficient to drive the expression of a marker gene such as 
Green Fluorescent Protein (GFP). Alternatively, the promoter used in the expression vector 

20 may be derived from that of the /jttozyme gene that is expressed in both the oviduct and 
macrophages. In particular embodiments, the gene regulatory sequences are flanked by 
matrix attachment regions (MARs), preferably, but not limited to those associated with the 
lysozyme gene in chickens or other avians. The nucleic acid encoding the polypeptide may 
be operably linked to a transcription promoter and/or a transcription terminator. 

25 Other embodiments of the invention provide for transgenic avians, such as chickens 

or quail, carrying a transgene in the genetic material of their germ-line tissue, preferably 
where the transgene was not introduced into the avian genome using a eukaryotic viral 
promoter. The transgene incorporated into the genomic DNA of a recipient avian can 
encode at least one polypeptide that may be, for example, but is not limited to, a cytokine, a 

30 growth factor, enzyme, structural protein, immunoglobulin, or any other polypeptide of 
interest that is capable of being expressed by an avian cell or tissue. Preferably, the 
heterologous protein is a mammalian, preferably a human, protein or derived from a 
T naTTima1inn J or preferably a human, protein {e.g. , a derivative or variant thereof). In 
particular embodiments, the invention provides heterologous proteins isolated or purified 

35 from an avian tissue, preferably serum, more preferably eggs, most preferably egg whites, 
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and pharmaceutical compositions comprising such heterologous proteins. In a more 
preferred embodiment, the heterologous protein is an antibody that is human (including 
antibodies produced from human immunoglobulin sequences in mice or in antibody 
libraries or synthetically produced but having variable domain framework regions that are 

5 the same as or homologous to human framework regions) or humanized 

The present invention further relates to nucleic acid vectors (preferably, not derived 
from eukaryotic viruses, except, in certain embodiments, for eukaryotic viral promoters and/ 
or enhancers) and transgenes inserted therein that incorporate multiple polypeptide- 
encoding regions, wherein a first polypeptide-encoding region is operatively linked to a 

10 transcription promoter and a second polypeptide-encoding region is operatively linked to an 
Internal Ribosome Entry Sequence (IRES). For example, the vector may contain coding 
sequences for two different heterologous proteins (e.g., the heavy and light chains of an 
immunoglobulin) or the coding sequences for all or a significant part of the genomic 
sequence for the gene from which the promoter driving expression of the transgene is 

15 derived, andthe heterologous protein desired to be expressed (e.g., a construct containing 
the genomic coding sequences, including introns, of the avian lysozyme gene when the avian 
lysozyme promoter is used to drive expression of the transgene, an IRES, and the coding 
sequence for the heterologous protein desired to be expressed downstream (i.e., 3' on the 
RNA transcript of the IRES). Thus, in certain embodiments, the nucleic acid encoding the 

20 heterologous protein is introduced into the 5' untranslated or 3' untranslated regions of an 
endogenous gene, such as but not limited to, ovalbumin, lysozyme, ovomucoid, 
ovotransfemn, conalbumin, and ovomucin, with an IRES sequence directing translation of 
the heterologous sequence. 

Such nucleic acid constructs, when inserted into the genome of an avian and 

25 expressed therein, will generate individual polypeptides that may be post-translationally 
modified, for example, glycosylated or, in certain embodiments, form complexes, such as 
heterodimers with each other in the white of the avian egg. Alternatively, the expressed 
polypeptides may be isolated from an avian egg and combined in vitro, or expressed in a 
non-reproductive tissue such as serum, hi other embodiments, for example, but not limited 

30 to, when expression of both heavy and light chains of an antibody is desired, two separate 
constructs, each containing a coding sequence for one of the heterologous proteins operably 
linked to a promoter (either the same or different promoters) - 9 are introduced into embryonic 
cells by sperm-mediated transfection to generate transgenic avians that harbor both 
transgenes in their genomes and expressing both heterologous proteins are identified. 

35 Alternatively, two transgenic avians each containing one of the two heterologous proteins 
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(e.g., one transgenic avian having a transgene encoding the light chain of an antibody and a 
second transgenic avian having a transgene encoding the heavy chain of the antibody) can 
be bred by Mendelian genetics to obtain an avian containing both transgenes in its gennline 
and expressing both transgene encoded proteins, preferably in eggs. (See Hartl and Jones, 
5 2001, Genetics: Analysis of Genes and Genomes, 5th ed, Jones & Bartlett Publishers, Inc., 
the content of which is incorporated by reference herein in its entirety). 

For convenience, certain terms employed in the specification, examples, and 
appended claims are collected here. 

Additional objects and aspects of the present invention will become more apparent 
10 upon review of the detailed description set forth below when taken in conjunction with the 
accompanying figures, which are briefly described as follows. 

3.1 DEFINITIONS 

The term "animal" as used herein refers to all vertebrate animals, including birds. It 
15 also includes an individual animal in all stages of development, including embryonic and 
fetal stages. 

The term "avian" as used herein refers to any species, subspecies or race of organism 
of the taxonomic class aves, such as, but not limited to, chicken, quail, turkey, duck, goose, 
pheasants, parrots, finches, hawks, crows and ratites including ostrich, emu and cassowary. 
20 The term includes the various known strains of Gallus gcdlus, or chickens, (for example, 
White Leghorn, Brown Leghorn, Barred-Rock, Sussex, New Hampshire, Rhode Island, 
Ausstralorp, Minorca, Amrox, California Gray, Italian Partridge-colored), as well as strains 
of turkeys, pheasants, quails, duck, ostriches and other poultry commonly bred in 
commercial quantities. 

25 The term "male germ cells" as used herein refers to sperm, sperm cells, spermatozoa 

(z.e., male gametes) and developmental precursors thereof. Male germ cells with the 
capacity to swim and transfer nucleic acid to an ovum are herein referred to as "viable male 
germ cells." In fetal development, primordial germ cells are thought to arise from the 
embryonic ectoderm, and are first seen in the epithelium of the endodennal yolk sac at the 

30 E8 stage. From there they migrate through the hindgut endoderm to the genital ridges. In 
the sexually mature male vertebrate animal, there are several types of cells that are 
precursors of spermatozoa, and which can be genetically modified, including, the primitive 
spermatogonial stem cells, known as AO/As, which differentiate into type B spermatogonia. 
Hie latter further differentiate to form primary spermatocytes, and enter a prolonged meiotic 

35 prophase during which homologous chromosomes pair and recombine. Useful precursor 
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cells at several morphological/developmental stages are also distinguishable: preleptotene 
spermatocytes, leptotene spermatocytes, zygotene spermatocytes, pachytene spermatocytes, 
secondary, spermatocytes, and the haploid spermatids. The latter undergo further 
morphological changes during spermatogenesis, including the reshaping of their nucleus, 

5 the formation of aerosome, and assembly of the tail. The final changes in the spermatozoon 
(i.e., male gamete) take place in the genital tract of the female, prior to fertilization. 

The terms "ovum" and "oocyte" are used interchangeably herein. Although only 
one ovum matures at a time, an animal is born with a finite number of ova.. In avian 
species, such as a chicken, ovulation, which is the shedding of an egg from the ovarian 

10 follicle, occurs when the brain's pituitary gland releases a luteinizing hormone, LH. Mature 
follicles form a stalk or pedicel of connective tissue and smooth muscle. Immediately after 
ovulation the follicle becomes a thin-walled sac, the postovulatory follicle. The mature 
ovum erupts from its sac and starts its journey through the oviduct Eventually, the ovum 
enters the in&ndibulum where fertilization occurs. Fertilization must take place within 1 5 

15 minutes of ovulation, before the ovum becomes covered by albumen. During fertilization, 
sperm (avians have polyspermic fertilization) penetrate the blastodisc. When the sperm 
lodges within this germinal disk, an embryo begins to form as a "blastoderm" or "zygote" 

The term "embryonic cells" as used herein refers to cells that are typically single cell 
embryos, fertilized or unfertilized, or the equivalent thereof and is meant to encompass 

20 dividing embryos, such as two-cell, four-cell, or even later stages as described by Eyal- 
Giladi and Kochav (1976, Dev. Biol 49: 321-337) and ova 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 
14, 16, 18, or 20 hours after the preceding lay. The embryonic cells may be isolated freshly, 
maintained in culture, or reside within an embryo. 

The term "fragment" as used herein to refers to an at least 10, 20, 50, 75, 100, 150, 

25 200, 250, 300, 500, 1000, 2000 or 5000 nucleotide long portion of a nucleic acid (e.g., 
cDNA) that has been constructed artificially (e.g. , by chemical synthesis) or by cleaving a 
natural product into multiple pieces, using restriction endonucleases or mechanical shearing, 
or enzymatically, for example, by PCR or any other polymerizing technique known in the 
art, or expressed in a host cell by recombinant nucleic acid technology known to one of skill 

30 in the art The term "fragment" as used herein may also refer to an at least 5, 10, 20, 30, 40, 
50, 75, 100, 150, 200, 250, 300, 400, 500, 1000, 2000 or 5000 amino acid portion of a 
polypeptide, which portion is cleaved from a naturally occurring polypeptide by proteolytic 
cleavage by at least one protease, or is a portion of the naturally occurring polypeptide 
synthesized by chemical methods or using recombinant DNA technology (e.g., expressed 

35 
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from a portion of the nucleotide sequence encoding the naturally occurring polypeptide) 
known to one of skill in the art 

The term "isolated nucleic acid" as used herein refers to a nucleic acid that has been 
removed from other components of the cell containing the nucleic acid or from other 

5 components of chemical/synthetic reaction used to generate the nucleic acid. In specific 
embodiments, the nucleic acid is 50%, 60%, 70%, 80%, 90%, 95%, 99% or 100% pure. 
The "isolated nucleic acid" is neither (a) identical to that of any naturally occurring nucleic 
acid nor (b) identical to that of any fragment of a naturally occurring genomic nucleic acid 
spanning more than three separate genes, and includes DNA, RNA, or derivatives or 

10 variants thereof. The term covers, for example, (a) a DNA which has the sequence of part 
of a naturally occurring genomic molecule but is not flanked by at least one of the coding 
sequences that flank that part of the molecule in the genome of the species in which it 
naturally occurs; (b) a nucleic acid incorporated into a vector or into the genomic nucleic 
acid of a prokaryote or eukaryote in a manner such that the resulting molecule is not 

15 identical to any vector or naturally occurring genomic DNA; (c) a separate molecule such as 
a cDNA, a genomic fragment, a fragment produced by polymerase chain reaction (PCR), 
ligase chain reaction (LCR) or chemical synthesis, or a restriction fragment; (d) a 
recombinant nucleotide sequence that is part of a hybrid gene, Le. 9 a gene encoding a fusion 
protein; and (e) a recombinant nucleotide sequence that is part of a hybrid sequence that is 

20 not naturally occurring. The techniques used to isolate and characterize the nucleic acids 
and proteins of the present invention are well known to those of skill in the art and standard 
molecular biology and biochemical manuals may be consulted to select suitable protocols 
without undue experimentation. See, Sambrook etal, Molecular Cloning: A , 
Laboratory Manual, 3rd ed, Cold Spring Harbor Press (2001); the content of which is 

25 herein incorporated by reference in its entirety. 

By the use of the term "enriched" in reference to nucleic acid it is meant that the 
specific DNA or RNA sequence constitutes a significantly higher fraction of the total DNA 
or RNA present in the cells or solution of interest than in normal or diseased cells or in the 
cells from which the sequence was taken. Enriched does not imply that there are no other 

30 DNA or RNA sequences present, just that the relative amount of the sequence of interest 
has been significantly increased, for example, by 1 fold, 2 fold, 5 fold, 10 fold, 50 fold, 100 
fold, 500 fold, 1000 fold, 10,000 fold, 100,000 fold or 1,000,000 fold. The other DNA may, 
for example, be derived from a yeast or bacterial genome, or a cloning vector, such as a 
plasmid or a viral vector. 

35 
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The term "transcription regulatory sequences" as used herein refers to nucleotide 
sequences that are associated with a gene nucleic acid sequence and that regulate the 
transcriptional expression of the gene. The "transcription regulatory sequences" may be 
isolated and incorporated into a vector nucleic acid to enable regulated transcription in 

5 appropriate cells of portions of the vector DNA. Exemplary transcription regulatory 
sequences include enhancer elements, hormone response elements, steroid response 
elements, negative regulatory elements, and the like. The 'transcription regulatory 
sequences" may be isolated and incorporated into a vector nucleic acid to enable regulated 
transcription in appropriate cells of portions of the vector DNA. The "transcription 

1 0 regulatory sequence" may precede, but is not limited to, the region of a nucleic acid 
sequence that is in the region 5 f of the end of a protein coding sequence that may be 
transcribed into mRNA. Transcriptional regulatory sequences may also be located within a 
protein coding region, in regions of a gene that are identified as "intron" regions, or may be 
in regions of nucleic acid sequence that are in the region of nucleic acid. 

15 The term "promoter" as used herein refers to the DNA sequence that determines the 

site of transcription initiation by an RNA polymerase. A "promoter-proximal element* ' may 
be a regulatory sequence within about 200 base pairs of the transcription start site. A 
"magnum-specific" promoter, as used herein, is a promoter that is primarily or exclusively 
active in the tubular gland cells of the avian magnum. Useful promoters also include 

20 exogenously inducible promoters. These are promoters that can be "turned on" in response 
to an exogenously supplied agent or stimulus, which is generally not an endogenous 
metabolite or cytokine. Examples include an antibiotic-inducible promoter, such as a 
tetracycline-inducible promoter, a heat-inducible promoter, a Ught-inducible promoter, or a 
laser inducible promoter. (See, eg., Halloran eta!., 2000, Development 127: 1953-1960; 

25 Gemer et al. , 2000, Int. 1 Hyperthermia 16: 171-81; Rang and Will, 2000, Nucleic Acids 
Res. 28: 1 120-5; Hagihara et aL, 1999, Cell Transplant 8: 43 14; Huang et dL , 1999, Mol 
Med.5: 129-37; Forsteref al. 9 1999, Nucleic Acids Res. 27: 708-10; Liu etal., 1998, 
Biotechniques 24: 624-8, 630-2; the contents of which have been incorporated herein by 
reference in their entireties). 

30 To facilitate manipulation and handling of the nucleic acid to be administered, the. 

nucleic acid is preferably inserted into a cassette where it is operably linked to a promoter. 
The promoter should be capable of driving expression in the desired cells. The selection of 
appropriate promoters can be readily accomplished. For some applications, a high 
expression promoter is preferred such as the cytomegalovirus (CMV) promoter. Other 

35 promoters useful in the present invention include the Rous Sarcoma Virus (RSV) promoter 
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(Davis et al, 1993, Hum. Gene Therap. 4:151). In other embodiments, all or aportion of 
the, for example, lysozyme, ovomucoid, albumin, conalbumin or ovotransferrin promoters, 
which direct expression of proteins present in egg white, are used, as detailed infra, or 
synthetic promoters such as the MDOT promoter described infra. 

5 The term "expressed" or "expression" as used herein refers to the transcription from 

a gene to give an RNA nucleic acid molecule complementary at least in part to a region of 
one of the two nucleic acid strands of the gene. The term "expressed" or "expression" as 
used herein also refers to the translation from said RNA nucleic acid molecule to give a 
protein or polypeptide or a portion thereof. 

.10 The term "matrix attachment regions" as used herein refers to DNA sequences 

having an affinity or intrinsic binding ability for the nuclear scaffold or matrix. The MAR 
elements of the chicken lysozyme locus were described by Phi-Van et aL, 1996, E.M.B.O. J. 
76:665-664 and Phi-Van, L. and Stratling, W.H., 1996, Biochem. 35:10735-10742; 
incorporated herein by reference in their entireties. 

15 The term "nucleic acid vector" as used herein refers to a natural or synthetic single 

or double stranded plasmid or viral nucleic acid molecule, or any other nucleic acid 
molecule, such as but not limited to YACs, BACs, bacteriophage-derived artificial 
chromosome (BBPAC), cosmid or PI derived artificial chromosome (PAC), that can be 
transfected or transformed into cells and replicate independently of, or within, the host cell 

20 genome. A circular double stranded vector can be linearized by treatment with an 

appropriate restriction enzyme based on the nucleotide sequence of the vector. A nucleic 
acid can be inserted into a vector by cutting the vector with restriction enzymes and ligating 
the pieces together. The nucleic acid molecule can be RNA or DNA. 

The term "expression vector" as used herein refers to a nucleic acid vector that 

25 comprises regulatory sequences operably linked to a nucleotide sequence coding for at least 
one polypeptide. As used herein, the term "regulatory sequences" includes promoters, 
enhancers, and other elements that may control expression. Standard molecular biology 
textbooks such as Sambrook et aL, (svpra) and Lodish et al, eds "Molecular Cell Biology" 
Freeman (2000) and incorporated herein by reference in their entireties, may be consulted to 

30 design suitable expression vectors, promoters, and other expression control elements. It 
should be recognized, however, that the choice of a suitable expression vector depends upon 
multiple factors including the choice of the host cell to be transformed and/or the type of 
protein to be expressed. Also useful for various applications are tissue-selective (i.e., tissue- 
specific) promoters, i.e., promoters from which expression occurs preferentially in cells or a 

35 particular kind of tissue, compared to one or more other types of tissue. For example, 
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chicken oviduct-specific promoters naturally associated with the proteins of avian egg 
whites including, but not limited to, lysozyme, ovomucoid, albumin, conalbumm, and 

ovotransferrin may be used. 

The term "recombinant cell" refers to a cell that has a new combination of nucleic 

5 acid segments that are not covalently linked to each other in nature in that particular 
configuration. A new combination of nucleic acid segments can be introduced mto an 
organism using a wide array of nucleic acid manipulation techniques available to those 
skilled in the art. A recombinant cell can be a single eukaryotic cell, or a single prokaryotic 
cell orarnamrnaliancell. The recombinant cell may harbor a vector that is extragenomic. 

10 An extragenomic nucleic acid vector does not insert into me cell's genome. A recombinant 
cell can further harbor a vector or a portion thereof (e. g. , the portion containing the 
regulatory sequences and the coding sequence) that is intragenomic. The term 
"intragenornic" defines a nucleic acid construct incorporated within the recombinant cell's 

genome. . 

15 The terms "recombinant nucleic acid" and "recombinant DNA" as used herem refer 

to a combination of at least two nucleic acid sequences mat is not naturally found in a 
eukaryotic or prokaryotic cell in that particular configuration. The nucleic acid sequences 
may include, but are not limited to, nucleic acid vectors, gene expression regulatory 
elements, origins of replication, sequences that when expressed confer antibiotic resistance, 

20 and protein-encoding sequences. The term "recombinant polypeptide" is meant to include a 
polypeptide produced by recombinant DNA techniques such that it is distinct from a 
naturally occurring polypeptide either in its location, purity or structure. Generally, such a 
recombinant polypeptide will be present in a cell in an amount different from that normally 
observed in nature. 

25 As used herein, the term "transgene" refers to a nucleic acid sequence (encoding, for 

example, a human interferon polypeptide) that is partly or entirely heterologous, i e. , 
foreign, to the transgenic animal or cell into which it is introduced, or, is homologous to an 
endogenous gene of the transgenic animal or cell into which it is inlroduced, but which is 
designed to be inserted, or is inserted, into the animal's genome in such a way as to alter the 

30 genome of the cell into which it is inserted (e.g., it is inserted at a location which differs 
from thatofthe natural gene or its insertion results in a knockout). A transgene also 
includes a regulatory sequence designed to be inserted into the genome such that it regulates 
the expression of an endogenous coding sequence, e.g., to increase expression and or to 
change the timing and or tissue specificity of expression, etc. (e.g., to effect "gene 

35 activation"). 
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The term 'transgenic animal" as used herein refers to an animal, including an avian 
species such as a chicken, in which one or more of the cells of the animal contains 
heterologous nucleic acid introduced by way of human intervention, such as by transgenic 
techniques well known in the art or by the methods of the present invention. 
5 As used herein, a "transgenic avian" is any avian species, including but not limited 

to, chicken, turkey, duck, goose, quail, pheasants, parrots, finches, hawks, crows and ratites 
including ostrich, emu and cassowary, in which one or more of the cells of the avian may 
contain heterologous nucleic acid introduced by way of human intervention, such as by 
transgenic techniques known in the art, and particularly, as described herein. The nucleic 
10 acid is introduced into a cell, directly or indirectly by introduction into a precursor of the 
cell, by way of deliberate genetic manipulation, such as by microinjection or by infection 
with a recombinant virus. The term genetic manipulation does.not include classical cross- 
breeding, or in vitro fertilization (although it does include fertilization with sperm into 
which a transgene has been introduced, but rather is directed to the introducton of a 
15 recombinant DNA molecule. This molecule may be integrated within a chromosome, or it 
may be extrachromosomaUy replicating DNA. In the typical transgenic avian, the transgene 
causes cells to express a recombinant form of the subject polypeptide, e.g. either agonistic 
or antagonistic forms, or a form in which the gene has been disrupted. 

The terms "chimeric animal" or "mosaic animal" are used herein to refer to animals 
• 20 in which the recombinant gene is found, or in which the recombinant is expressed in some 
but not all cells of the animal. The term "tissue-specific chimeric animal" indicates that the 
polypeptide encoding gene is present and expressed in some tissues, but not others. 

The term "knock-in animal" refers to an animal that carries a specific nucleic acid 
sequence such as a "knock-in sequence" in a predetermined coding or noncoding region, 
25 wherein the knock-in sequence is introduced through methods of recombination, such as 
homologous recombination. The recombination event comprises replacing all or part of a 
gene of toe animal by a functional homologousgene or gene segment of another animal, 
where the respective knock-in sequence is placed in the genomic sequence. 

The term "chromosomal positional effect (CPE)" as used herein refers to the 
30 variation in the degree of gene transcription as a function of the location of the transcribed 
locus within the cell genome. Random transgenesis may result in a transgene being inserted 
at different locations in the genome so that individual cells of a population of transgemc 
cells may each have at least one transgene, each at a different location and therefore each in 
a different genetic environment Each cell, therefore, may express the transgene at a level 
35 specific for that particular cell and dependant upon the immediate genetic environment of 
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the transgene. In a transgenic avian, as a consequence, different tissues may exhibit 
different levels of transgene expression. 

The term "cytokine" as used herein refers to any secreted polypeptide that affects the 
functions of cells and is a molecule that modulates interactions between cells in the 

5 immune, inflammatory or hematopoietic response. A cytokine includes, but is not limited 
to monokines and lymphokines regardless of which cells produce them. For instance, a 
monokine is generally referred to as being produced and secreted by a mononuclear cell, 
such as a macrophage and/or monocyte. Many other cells however also produce monokines, 
such as natural killer cells, fibroblasts, basophils, neutrophils, endothelial cells, brain 

10 astrocytes, bone marrow stromal cells, epidermal keratinocytes and B-lymphocytes. 

Lymphokines are generally referred to as being produced by lymphocyte cells. Examples of 
cytokines include, but are not limited to, Interleukin-1 (DL-1), Interleukin-6 (EL-6), 
mterleukin-8 (TL-8), Tumor Necrosis Factor-alpha CTNF-alpha) and Tumor Necrosis Factor 
p(TNF-p). 

15 The term "antibody" as used herein refers to polyclonal and monoclonal antibodies 

and fragments Ihereof, and immunologic binding equivalents thereof. The term "antibody" 
refers to a homogeneous molecular entity, or a mixture such as a polyclonal serum product 
made up of a plurality of different molecular entities, and may further comprise any 
modified or derivatised variant thereof that retains the ability to specifically bind an epitope. 
20 A monoclonal antibody is capable of selectively binding to a target antigen or epitope. 

Antibodies may include, but are not limited to polyclonal antibodies, monoclonal antibodies 
' (mAbs), humanized or chimeric antibodies, single chain antibodies, Fab fragments, F^ 
fragments, disulfide-linked Fvs (sdFv) fragments produced by a Fab expression library, anti- 
idiotypic (anti-Id) antibodies, intrabodies, synthetic antibodies, and epitope-binding 
25 fragments of any of the above. 

The term "immunoglobulin polypeptide" as used herein refers to a polypeptide 
derived from a constituent polypeptide of an immunoglobulin. An "immunoglobulin 
polypeptide" may be, but is not limited to, an immunoglobulin (preferably an antibody) 
heavy or light chain and may include a variable region, a thversity region, joining region and 
30 a constant region or any combination, variant or truncated form thereof. The term 

'immunoglobulin polypeptide" further includes single-chain antibodies comprised of, but 
not limited to, an immunoglobulin heavy chain variable region, an immunoglobulin light 
chain variable region and optionally a peptide linker. 

The term "origin of replication" (pri) as used herein refers to unique regions of a 
35 nucleic acid sequence containing multiple short repeated sequences, recognized by 
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multimeric origin of replication binding proteins that organize the assembly of multiple 
enzymes and proteins required for the replication of the nucleic acid. The origin of 
replication derived from K colt may be included in a plasmid for replication of the plasnud 
inabacterialhost. The SV40 viral ori is a 65 bp region derived from the SV40 viral 
5 chromosome that when included in a nucleic acid sequence will allow replication of the 
nucleic acid in an animal cell. Inclusion of the SV40 on region in a plasmid that also has 
teJE«tf«f element will allow 4e plasmid to be repUcatedmbolh a bacterial host and in 

an 7*nimal cell. 

The term "centromere" as used herein refers to a small, specialized region of a 
10 chromosome recognized as a constriction in a condensed chromosome. A kinetochore lies 
within the centromeric region and is attached to microtubules extending to the poles of a 
dividing cell. 

The term "telomere" as used herein refers to repetitive oligomeric nucleic acid 
sequences located at the ends of linear eukaryotic chromosomes. Telomeres are required to 
15 prevent shortening of chromosomal DNA during replication of the linear nucleic acid. 

Recombinant expression vectors can be designed for the expression of the encoded 
proteins eukaryotic cells. Useful vectors may comprise constitutive or inducible promoters 
to direct expression of either fusion or non-fusion proteins. With fusion vectors, anumber 
of amino acids are usually added to the expressed target gene sequence such as, but not 
20 limited to, a protein sequence for thioredoxin. A proteolytic cleavage site may further be 
introduced at a site between the target recombinant protein and the fusion sequence. 
Additionally, a region of amino acids, such as a polymeric Mstidine region, may be 
introduced to allow binding of the fusion protein to metallic ions such as nickel bonded to a 
solid support, and thereby allow purification of the fusion protein. Once the fusion protein 
25 has been purified, the cleavage site allows the target reoombmant pretem to be separated 
from the fusion sequence. Enzymes suitable for use in cleaving the proteolytic cleavage site 
include, but are not limited to, Factor Xa and thrombin. Fusion expression vectors that may 
be useful in the present invention include pGex (Amrad Corp., Melbourne, Australia), 
pRTT5 (Pharmacia, Piscataway, NJ) and pMAL (New England Biolabs, Beverly, MA), that 
30 fi^glutatMoneS-tran^ 
target recombinant protein. 

Expression of a foreign gene can be obtained using eukaryotic host cells such as, but 
not limited to, mammalian or avian cells. The use of eukaryotic host cells permit partial or 
complete post-translational modification such as, but not only, glycosylate and/or the 
35 formation of the relevant inter- or mtm-cham disulfide bonds. Examples of vectors useful 
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for expression in the chicken Gallus gallus include pYepSecl as in Baldari etd., 
E M.B.OJ. 6, 229-234 (1987) and pYES2 (Invitrogen Corp., San Diego, CA), incorporated 
herein by reference in their entireties; Once the isolated DNA molecule of the present 
invention has been cloned into an expression system, it is ready to be incorporated into a 
5 host cell. 

Hie terms "transformation" and "transfection" as used herein refer to the process of 
inserting a nucleic acid into a cell Many techniques are well known to those skilled in the 
art to facilitate transformation or transfection of a nucleic acid into a prokaryotic or 
eukaryotic organism. These methods involve a variety of techniques, such as treating the 
10 cells with high concentrations of salt such as, but not only, a calcium or magnesium salt, an 
electric field, detergent, or liposome mediated transfection, to render the host cell competent 
for the uptake of the nucleic acid molecules, and by such methods as sperm-mediated and 
restriction-mediated integratioiL 

The term 'transfecting agent" as used herein refers to a composition of matter added 
15 to the genetic material for enhancing the uptake of heterologous DNA segments) into a 
eukaryotic cell, preferably an avian cell, and more preferably a chicken male germ cell. The 
enhancement is measured relative to the uptake in the absence of the transfecting agent 
Examples of transfecting agents include adenovirus-tratisferrin-polylysine-DNA complexes. 
These complexes generally augment the uptake of DNA into the cell and reduce its 
20 breakdown during its passage through the cytoplasm to the nucleus of the cell. These 
complexes can be targeted to the male germ cells using specific ligands that are recognized 
by receptors on the cell surface of the germ cell, such as the c-kit ligand or modifications 
thereof. 

Other preferred transfecting agents include, but are not limited to> hpofecun, 
25 Hpf w tamme,DIMRlEC,Supeffect,andEffecu^ 

DOGS (Transfectam; moctadecylanudoglycylspermine), DOPE (l^-dioleoyl-sn-glycero-3- 
phosphoethanolamine), DOTAP (U-moleoyl-3-trimethylammonium propane), DDAB 
(dimethyl dioctadecytammonium bromide), DHDEAB (N^-di-n-hexadecyi-N^- 
dihydroxyethyl ammonium bromide), HDEAB (N-n-hexadecyIN,N- 
30 dihydroxyethylammonium bromide), polybrene, or poly(emylenimine) (PET). These 
nonviral agents have the advantage that they can facilitate stable integration of xenogeneic 
DNA sequences into the vertebrate genome, without size restrictions commonly associated 
with virus-derived transfecting agents. 

The terms "intracytoplasmic sperm injection" and "ICSI" as used herein refer to 
35 delivering an exogenous nucleic acid to a recipient cell by associating the exogenous 
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nucleic acid with the head of a sperm cell and then delivering the sperm cell head to the 
recipient cell by microinjection The exogenous nucleic acid may be integrated mto the 
endogenous genomic nucleic acid of the sperm, non-integrated as an episomal element of. 
the nucleic acid complement of the sperm head, or linked internally or externally to the head 
5 ofthesperm. The terms "chlCSI" and "CfflCSl™" as used herein refer to intracytoplasmic 

sperm injection into a chicken cell. 

The terms "sub-zonal injection" and "SUZT refer to delivering viable spermatozoa 
to an oocyte by microinjection, wherein the sperm are microinjected between the zona 
pellucida and the cytoplasmic membrane of an oocyte. 
10 The term "gene delivery (or transfection) mixture" as used herein, in the context of 

the methods of sperm mediated transfer described herein, refers to selected genetic material 
in an appropriate vector mixed, for example, with an effective amount of lipid transfecting 
agent, for example, a cationic or polygenic lipid, such as polybrene. The amount of each 
component of the mixture is chosen so that the genetic modification, e.g., by transfection or 
15 transduction, of a specific species of male germ cell is optimized. Such optimization 
requires no more than routine experimentation. The ratio of DNA to lipid is broad, 
preferably about 1:1, although other proportions can also be utilized depending on the type 
of lipid transfecting agent used. 

This application uses gene nomenclature accepted by the Cucurbit Genetics 
20 Cooperative as it appears in the Cucurbit Genetics Cooperative Report 18:85 (1995); herein 
incorporated by reference in its entirety. Using this gene nomenclature, genes are 
symbolized by italicized Roman letters. If a mutant gene is recessive to the normal type, 
then the symbol and name of the mutant gene appear in italicized lower case letters. 

25 32 ABBREVIATIONS 

Abbreviations used in the present specification include the following: aa, amino 
acid(s); bp, base pair(s); cDNA, DNA complementary to RNA; mRNA, messenger RNA; 
tRNA, transfer RNA; nt, nucleotide®; SSC, sodium chloride-sodium citrate; MAR, matrix 
attachment region; DMSO, dimethyl sulfoxide; TPLSM, two photon laser scanning , 
30 microscopy; REM, restriction enzyme mediated integration; WEFs, whole embryo 
fibroblasts. 

4. TtRTEl? nFSCRJP^N Off THE FIGURES 
FIGS 1 A-E illustrate the nucleotide sequence (SEQ ID NO: 6) comprising the 
35 chicken lysozyme gene expression control region (SEQ ID NO: 7), the nucleotide sequence 
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encoding the chicken expression optimized human interferon a2b (EFNMAGMAX; SEQ ID 
NO: 5) and a SV40 polyadenylation signal sequence (SEQ ID NO: 8). 

FIG. 2 illustrates the nucleotide sequence SEQ ID NO: 5 encoding the chicken 
5 expression optimized human interferon a2b (IFNMAGMAX). 

FIGS. 3A-E illustrate the nucleotide sequence SEQ ID NO: 7 encoding the chicken 
lysozyme gene expression control region. 

10 FIG. 4 illustrates the nucleotide sequence SEQ ID NO: 8 encoding the SV40 

polyadenylation signal sequence. 

FIGS. 5A-C illustrate the nucleotide sequence SEQ ID NO: 9 encoding the chicken 
lysozyme 3' domain. 

15 

FIGS 6AJ illustrate the nucleotide sequence SEQ ID NO: 10 encodingthe 
lysozyme gene expression control region (SEQ ID NO: 7) linked to the insert having the 
nucleotide sequence of SEQ ID NO: 5 encoding the chicken expression-optimized human 
interferon o2b (IFNMAGMAX) and the chicken lysozyme 3 1 domain SEQ ED NO: 9. 

20 - . , 

FIG. 7 illustrates the nucleotide sequence SEQ ID NO: 11 of the combinatorial 

promoter MDOT. 

FIGS 8A-B illustrate the oligonucleotides and primers (SEQ ID NOS: 14-31) used 
25 fcfceformationofmecMckencodonop 
acid. 

FIG. 9 illustrates the primers (SEQ ID NOS: 32-35) used in the synthesis of the 
MDOT promoter. 

30 

FIG. 10 illustrates the level ofhuman monoclonal antibodies IgG expressed m the 
serum of transgenic chick using ELISA 

nG. 1 1 illustrates the detection of EGFP positive bands from transgenic sperm 

35 
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5. BETAELED DESTHTPTION OF ™~ INVENTION 
• The present invention relates to methods of introducing nucleic acids into avian 
oocytes by sperm-mediated transfection to produce a transgenic chicken or quail, or other 
avian species, carrying the transgene in the genetic material in all or most of its tissue, 

5 including germ-line tissue. The methods and vectors of the present invention further 

generate transgenic avians that express heterologous genes in the serum of the avian and/or 
are deposited into an avian egg, preferably in the egg white. Vectors containing promoters 
that direct high level of expression of the heterologous protein in the avian, particularly in 
the magnum for deposition into the avian egg are provided. Additional regulatory elements, 

10 such as MARs, IRES's, enhancers, polyadenlyation signals, etc., may be included in the 
vectors of the invention to improve expression and efficiency. 

5.1 MFTnOPS OF TRANSGENESIS 
5.1.1 SPERM-MEDIATED INTEGRATION OF HETEROLOGOUS 

15 TRANSGENES 

The transgenic avians of the present invention are most preferably generated using 
sperm-mediated transfection of nucleic acid into avian oocytes. Specifically, the present 
invention provides methods for introducing nucleic acids containing a transgene, preferably, 
anucleic acid vector of the invention as described in Section 52, infra, into an avian oocyte 
20 by sperm-mediated transfection. In preferred embodiments, the nucleic acid is first 
introduced into an avian sperm in vitro by lipofection, electroporahon, restriction enzyme 
mediated integration (REMT) or similar methods, or in vivo by microinjection into the testis, 
and the modified avian sperm is then delivered to an avian oocyte by natural coitus after the 
modified avian sperm are returned to the testis of a male avian or in the method in which the 
25 nucleic acid has been injected directly into the testis or in vitro by microinjection, 

intracytopiasmicosperm injection (ICSI) or artificial insemination of oocytes isolated from 
an ovulating female bird, thereby generating a transgenic zygote and chick. In certain 
embodiments, the male germ cells are irradiated, more preferably irradiated by gamma rays, 
before the heterologous nucleic acid is incorporated therein. In other embodiments, the 
30 testis is depopulated of sperm prior to introduction of Ihe transfected sperm 

The present invention contemplates that any technique capable of transferring 
heterologous material into sperm could be used so long as the technique preserves enough 
of the sperm's fertilization functions, such that the resultant sperm will be able to fertilize 
the oocyte. It is understood that the heterologous nucleic acid may be integrated into the 
35 genome of a recipient cell such as a spermatogonia! cell or a spermatogonia! precursor cell 



-23- 



WO 03/024199 



PCT/US02/30156 



for subsequent transfer to an embryo or the testicular material of the recipient male animal, 
preferably a chicken. It is further understood that the heterologous nucleic acid may not be 
integrated into the genome of the recipient cell but delivered as an episome which may or 
may not be integrated into the genome of the resulting zygote or chick. 

5 

5.1.1.1 PREPARATION OF TRANSGENIC CONSTRUCT 

One aspect of the present invention relates to the preparation of a transgene which is 
to be incorporated into the genome of an avian sperm. In certain embodiments, the 
transgene comprises at least one heterologous nucleic acid. It is.contemplated to be within 
10 the scope of the present invention for the heterologous nucleic acid to comprise an 
expression vector such as, but not limited to, viral vectors, plasmid vectors, or linearized 
nucleic acid vectors or a combination thereof. (See section 5 .2, infra, for details on vectors, 
and the preparation thereof). Ihe expression vector may particularly be any suitable 
nonviral vector including plasmid DNA, bacteria artificial chromosomes (BACs), yeast 
15 artificial chromosomes (YACs), etc. The expression vector may also be any suitable viral 
vector, for example, retroviral vectors, adenoviral vectors, transferrin-polylysine enhanced 
adenoviral vectors, human immunodeficiency virus vectors, lentiviral vectors, Moloney 
murine leukemia virus-derived vectors, and virus-derived DNAs that facilitate 
polynucleotide uptake by and release into the cytoplasm of germs cells. 
20 Transcriptional promoters of an expression vector of the present invention may be a 

constitutive^ active promoter such as the cytomegaloviral promoter or Rous sarcoma virus 
promoter, or a tissue-specific promoter, preferably a tissue-specific promoter operable in 
oviduct cells of an avian species including, but not limited to, the promoters of the genes 
encoding ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and ovomucin. 
25 Optionally, the transcriptional promoter of an expression vector may be a regulatable 
promoter. The expression vector may further comprise a region encoding a transcriptional 
terminator, such as a bovine growth hormone transcriptional terminator. 

In preferred embodiments, a transgene construct comprises at least two separate or 
independent elements. A first element could comprise an oviduct-specific promoter, such 
30 as, but not limited to ovalbumin, lysozyme, ovomucoid, ovotransferrin, coiialbumin, and 
ovomucin, which would drive expression of a gene coding for a protein of interest in the 
oviduct Asecondelememcanbelocateddfcerupst^ 

element and comprises a protamine promoter, or a segment thereof that is sufficient to drive 
the expression of. a marker gene such as the Green Fluorescent Protein (GFP) to facilitate 
35 identification of transfected sperm. 

-24- 



WO 03/024199 



PCT/US02/30156 



In one embodiment of the present invention, the heterologous nucleic acid comprises 
cohesive ends characterized as capable of hybridizing to cohesive ends generated by a 
restriction endonuclease. The cohesive ends on the nucleic acid may be generated by 
restriction endonuclease cleavage of a circular or linear nucleic acid, by the chemical 
5 addition of nucleotides to the ends of a linear nucleic acid, or by a combination of chemical 

and enzymatic methods. 

In another embodiment of the present invention, the heterologous nucleic acid is 
linearized and has at least one blunt end. The blunt end of the nucleic acid may be 
generated, by an exonuclease digestion of cohesive ends, such as SI nuclease. 
10 in m e memods of generating transgemcceU^ 

genomic nucleic acid of the recipient cell, male germ cell or oocyte can be cleaved to 
receive the integrating heterologous nucleic acid. Any method may be selected that will 
generate limited, random cleavage that will allow integration of the heterologous nucleic 
acid into the genome of the recipient cell or oocyte. When the integrating heterologous 

15 nucleic acid has cohesive ends, the recipient genomic nucleic acid may be cleaved with a 
restriction endonuclease generating cohesive ends capable of hybridizing to the cohesive 
ends of the heterologous nucleic end. When the heterologous nucleic acid has blunt ends, 
the genomic nucleic acid can be cleaved by any method that will generate blunt ends at the 
cleavage site, deluding restriction endonuclease cleavage, or irradiation of the cell with 

20 high-energy irradiation. Suitable radiations that may be applied to the methods of the 
present invention include, for example, gamma rays, x-rays, ultraviolet light or ultrasound. 
It is contemplated that the cleavage of genomic nucleic acid and integration of a 
heterologous nucleic acid therein will result in a viable recipient cell that can be used to 
fertilize an avian oocyte, or will not yield a viable cell. A non-viable sperm cell may, 

25 however, be used to deliver the transgene to an oocyte using, for example, the ICSI 
(CHICSI™) method. 

The heterologous nucleic acid of the present invention may further comprise a 
centromere element and at least one telomere element. In one embodiment, the centromere 
and the at least one telomeres are derived from the chicken. While the ori site alone will 

30 allow replication of the heterologous nucleic acid when transfected into an oocyte or zygote 
thereof, segregation of the replicates into each daughter cell will require the optional 
centromeric element In the absence of this centromeric element, segregation will be 
random between daughter cells with some daughter cells not receiving one copy of the 
transgenic nucleic acid. A mosaic transgenic animal would, therefore, result 

35 .. .. 
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In one embodiment of the present invention, therefore, the heterologous nucleic acid 
is an artificial chromosome comprising a heterologous transgenic element having the 
properties desired to be expressed by a transgenic animal, an origin of replication (or/) site, 
and a centromere. In this embodiment, the heterologous nucleic acid may be a circular 

5 nucleic acid or a linear nucleic acid In another embodiment, the heterologous nucleic acid 
is a linear nucleic acid further comprising telomeres. 

In another aspect of the methods according to the present invention, the transgenic 
oocyte or ovum of the present invention is incubated for development of Ihe zygote therein 
to a fetus, and subsequently to a chick for hatching. In one embodiment of the present 

10 invention, therefore, the 2ygote is incubated in a surrogate avian female, wherein the 

method comprises the steps of fistulating an avian female, delivering the avian oocyte to the 
infundibulum of the female bird, allowing the avian female to incubate the avian oocyte to 
an embryo within an egg, allowing the avian female to lay the egg, and allowing the embryo 
to hatch as a viable chick, wherein the chick is a transgenic chick having an exogenous 

15 nucleic acid incorporated therein. 

5.1.12 . SPERM TRANSGENESIS 

The heterologous nucleic acid may be delivered to an avian male germ cell (ie., 
sperm, spermatozoon cell or a precursor cell) by a method such as by contacting me male 

20 germ cell with a gene delivery mixture comprising a nucleic acid, either a eukaryotic viral 
vector or a vector that is not derived from a eukaryotic virus, at about or below the avian's 
body temperature and for an effective period of time such that the nucleic acid is 
incorporated into the cell, and preferably into the genome of the cell, optionally isolating or 
selecting the genetically modified cell with the aid of a genetic selection marker expressed 

25 in the genetically modified cell, transferring the isolated or selected genetically modified 
germ cell to a testis of a recipient male avian such that the cell lodges in a senuniferous 
tubule of the testis. A genetically modified male gamete may be produced therein, and 
breeding the recipient male avian with a female avian of its species will generate transgenic 
progeny that cany the heterologous transgenic nucleic acid in its genome. 

30 In certain embodiments, the avian male germ cells are isolated and removed from a 

male avian. The avian male germ cells is then transfected by introducing the heterologous 
nucleic acid into the genome of the avian male germ cells by lipofection, electroporation, 
restriction enzyme mediated infection (REMI) or similar methods. In certain other 
embodiments, the heterologous nucleic acid is injected directly into the testis of the male 

35 avian for transfection. Male germ cells can be extracted to determine whether transfection 
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has occurred or the extent of transfection. The male avian can be mated with a female avian 
to produce transgenic offsprings or the spenn can be used for IVF, 

The precursor cell may be selected from the group consisting of spermatogonial 
stem cells, type B spermatogonia, primary spermatocytes, preleptotene spermatocytes, 
5 leptotene spermatocytes, zygotene spermatocytes, pachytene spermatocytes, secondary 
spermatocytes, and spermatids. The embodiment further comprises the steps of 
incorporating the heterologous transgene into the genome of the spermatozoon cell or the 
precursor cell, so that a genetically modified male gamete is produced by the male avian, 
and breeding the male avian with a female of the same species such that a transgenic 
10 progeny is thereby produced that carries the polynucleotide in its genome. 

In certain embodiments, the heterologous genetic material may be introduced into 
the genome of an avian male germ cell, such that a polynucleotide is delivered using known 
gene delivery systems to male germ cells in situ in the testis of the male avian (e.g. , by in 
vivo transfection or transduction). In one embodiment, the invention relates to an in vitro 
15 method of incorporating heterologous genetic material into the genome of amale avian by 
isolating male germ cells ex corpora, delivering a polynucleotide thereto, and then returning 
the transfected cells to the testes of a recipient male bird. In yet another embodiment, the in 
vitro method involves microinjecting the recombinant male germ cells into a recipient 
fertilized oocyte, whereupon the sperm head enters the oocyte nucleus to deliver the 
20 heterologous nucleic acid thereto. 

In a preferred embodiment, the invention relates to an in vivo method that injects a 
gene delivery mixture, preferably into the seminiferous tubules, or into the testis, and most 
preferably into the vas efferens or vasa efferentia using, for example, a micropipette and a 
picopump delivering a precise measured volume under controlled amounts of pressure. The 
25 modified germ cells differentiate in their own milieu. Progeny animals exhibiting the 
nucleic acid's integration into its germ cells (ie., transgenic animals) are selected. The 
selected progeny can then be mated, or their sperm utilized for insemination or in vitro 
fertilization, to produce further generations of transgenic progeny or for microinjection into 
isolated oocytes. 

30 In another preferred embodiment, the invention relates to an in vitro method wherein 

male germ cells are obtained or collected from a donor male avian, by any means known in 
the art such as, for example, transection of the testes. The male germ cells are then exposed 
to a gene delivery mixture, preferably within several hours of collection, or cryopreserved 
for later use. When the male germ cells are obtained from the donor avian by transection of 

35 the testes, the cells can be incubated in an enzyme mixture known for gently breaking up the 
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tissue matrix and releasing undamaged cells. Suitable enzymes to disrupt the integrity of a 
tissue include, but are not limited to, pancreatic trypsin, collagenase type I, pancreatic 
DNAse type I, as well as bovine serum albumin and a modified DMEM medium. After 
washing the cells, they can be placed in an incubation medium such as DMEM or the like, 
5 and plated on a culture dish for genetic modification by exposure to a gene delivery mixture. 
In other embodiments, a transgene can be incorporated into an avian sperm by 
lipofection, electroporation, restriction enzyme mediated integration (REMT), 
intracytoplasmic sperm injection (ICSI) or similar methods. 

10 Liposome 

In a preferred embodiment, a transgene is incorporated into an avian sperm by 
liposomes. The male germ cells, which may be intact and viable spermatozoa, or the non- 
viable heads thereof; may be transfer to a recipient oocyte using liposome-mediaied 
delivery. PCT Publication WO 87/05325, which is incorporated by reference herein in its 
15 entirety, discloses a method of transferring organic and/or inorganic material into sperm or 
egg cells by using liposomes. The heterologous nucleic acid can also be incorporated into a 
male sperm using Lipofectin-based liposomes. {See, e.g., Bachiller et aL, 1991, MoL 
Reprod Develop. 30: 194-200; Nakanishi and Iritani, 1993, MoL Reprod Develop. 36: 258- 
261). 

20 

Electroporation 

In another preferred embodiment, a transgene is incorporated into an avian sperm by 
electroporation. The application of electrical current has been shown to enhance the uptake 
of exogenous DNA fragments by cultured cells. Enhancement of nuclear uptake of the 
25 heterologous DNA will promote earlier chromosomal integration of the exogenous DNA 
molecules, thus reducing the degree of genetic mosaicism observed in transgenic avian 
founders. 

In one embodiment, the male germ cells is placed in a cuvette and a solution of the 
transgenic nucleic acid coding the protein of interest is added. A direct current pulse is 
30 discharged in the cuvette suspension. The current pulse creates temporary, short-lived pores 
in the cell membrane and allow the male germ cells to take up the transgene while only 
slightly compromising cell viability. More description on the use of electroporation to 
incorporate DNA can be found in Gagne et aL 9 1991, MoL Reprod Develop. 29: 6-15, 
which is incorporated herein by reference in its entirety. 
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Restriction Enzyme Mediated Integration (REMJ) 

In yet another preferred embodiment, a transgene is incorporated into an avian sperm 
by restriction enzyme mediated integration (REM). The heterologous nucleic acid to be 
integrated into, for example, the sperm nuclear DNA is converted to a linear double 
5 stranded DNA possessing single-stranded cohesive ends by contacting the heterologous 
DNA with a type n restriction enzyme that upon scission, generates such ends. The nucleic 
acid to be cut can be a circular nucleic acid such as in a plasmid or a viral vector or a linear 
nucleic acid that possesses at least one recognition and cutting site outside of the genes or 
regulatory regions critical to the desired post-integration function of the nucleic acid, and no 
1 0 recognition and cutting sites within the critical regions. 

Alternatively the heterologous DNA to be integrated into the sperm nuclear DNA 
can be prepared by chemically and/or enzymatically adding cohesive ends to a linear DNA 
The added cohesive ends must be able to hybridize to the cohesive ends characteristic of a 
nucleic acid cleaved by a type II restriction endonuclease. Alternatively, the cohesive ends 
15 can be added by combining the methods based on type H restriction enzyme cutting and 
chemical and/or enzymatic addition. It is also within the scope of the present invention for 
the linearized nucleic acid to have one end that is a blunt end without unpaired nucleotides. 
Such blunt ends can be generated by restriction endonuclease digestion, exonuclease 
digestion of cohesive ends or fill-in of cohesive ends by polynucleotide synthesis, using 
20 methods as described, for example, in Sambrook et ed., (supra), incorporated herein by 
reference in its entirety. 

It is also to be understood that a nucleic acid to be delivered to a recipient cell may 
be cleaved with two different restriction endonucleases that may generate the same or 
different cohesive termini, or at least one blunt-end terminus. Neither restriction 
25 endonucleases will have a recognition site within the nucleic acid sequence required to be a 
transgene in the recipient cell. 

When a restriction endonuclease is used to cleave the genomic nucleic acid of the 
recipient cell, the endonuclease may be co-delivered to the recipient cell such as a sperm 
cell with the heterologous nucleic acid, or sequentially delivered If a nucleic acid is 
30 cleaved with at least two restriction endonucleases, thereby generating at least one cohesive 
terminus, the at least two endonucleases may be delivered to a recipient cell either together 
or sequentially. The transfected nucleic acid may be mixed with at least one of the 
endonucleases or delivered to a recipient cell before or after at least one endonuclease is , 
delivered thereto. 

35 
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At least one terminus of a linearized nucleic acid to be delivered to a recipient cell 
may be a blunt end terminus, generated by endonuclease cleavage, chemical synthesis, 
enzyme directed nucleic acid digestion or synthesis, or any combination thereof. A 
recipient cell genome such as a sperm cell genome, may therefore be cleaved before,, during 

5 or after delivery of the linearized nucleic acid to the cell, by delivery of a blunt-end 

generating restriction endonuclease to the recipient cell, or by radiation-induced cleavage. 
Suitable radiations that may be applied to, for example, a sperm cell include, but are not 
limited to, gamma radiation, x-rays, ultraviolet light and ultrasound. The dose and duration 
of the radiation applied to a cell sample are determined for each sample, for levels of 

10 cleavage that will allow integration of the transfected nucleic acid into the cell genome, 
while main tainin g viability of the cells for use in artificial insemination or recolonization of 
an avian testes. Viability of a recipient sperm may not be required when the transfected 
sperm are delivered to a recipient avian oocyte by such procedures as ICSI or CHICS]™. 
Cleavage of the genomic nucleic acid by irradiation or ultrasound can be either before, 

15 during or after delivery of the heterologous nucleic acid to the recipient cell. 

While not wishing to be bound by any one theory, the transfected nucleic acid may 
be integrated into a cleavage site of the genomic nucleic acid. Integration may be facilitated 
by the cohesive ends on the heterologous nucleic acid that hybridize to the like cohesive 
ends of the cleaved genomic nucleic acid. The integrated heterologous nucleic acid will 

20 then replicate and segregate with the genome of the recipient cell. 

Alternatively, the heterologous nucleic acid may not be integrated into a recipient 
genome, but will remain as an extrachromosomal episome. The heterologous nucleic acid 
of the present invention may circularize by hybridization of the cohesive ends of the nucleic 
acid, rather than be integrated into the genome. When the heterologous nucleic acid 

25 comprises any natural or synthetic origin of replication (ori element) the nucleic acid will be 
capable of replicating independently of the recipient genome. In one embodiment of the 
present invention the ori site included with a heterologous nucleic acid is derived from the 
SV40 virus. Episomal replication and segregation of daughter copies of the episome is 
facilitated by the linearized viral ori site and/or a centromere isolated from, for example, a 

30 chicken chromosome, thereby generating a chicken artificial chromosome. In another 

embodiment, the linearized heterologous nucleic acid will not be integrated into the genome 
of the recipient cell but remain as a separate unit that, because of a centromeric structure 
incorporated therein, will segregate into daughter cells during mitotic division. In this case, 
the unincorporated episomal heterologous nucleic acid is a chicken artificial chromosome 

35 (CAC). 
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The REMI method for stably integrating heterologous DNA into the genomic DNA 
of a recipient cell is described by Shemesh et al. in PCT Publication No. WO 99/42569 and 
incorporated herein by reference in its entirety. This REMI method comprises in part an 
adaptation of the REM technique disclosed by Schiest and Petes (Proc. Nat Acad. Sci. 

5 U.S A. 88, 7585-7589 (1991)) and Kuspa and Loomis (Proc. Nat Acad. Sci. U.SA., 89, 
8803-8807 (1992)) both incorporated herein by reference in their entireties. 

In preferred embodiments, the avian sperm are irradiated before being exposed to 
gene delivery mixture or having a transgene incorporated therein. The male germ cells can 
be irradiated with a suitable dose of gamma irradiation, preferably, 1 Gy, 2 Gy, 3 Gy, 4 Gy, 

10 5 Gy, 6 Gy, 7 Gy, 8 Gy, 9 Gy, 10 Gy, 1 1 Gy, 12 Gy, 15 Gy or 20 Gy, without compromising 
the viability and/or mobility of the sperms. (See Wooster et al., 1977, Can. J. Genet. Cytol 
19: 437-446). 

Whether employed in the in vivo, in situ or in vitro method, the gene delivery 
mixture, once in contact with the male germ cells, facilitates the uptake and transport of 

15 heterologous genetic material into the appropriate cell location for integration into the 
genome and expression. A number of known gene delivery methods can be used for the 
uptake of nucleic acid sequences into the cell and facilitate the integration of the 
heterologous nucleic acid into the genome of the recipient cell. Such methods include, but 
are not limited to viral vectors, liposomes, electroporation, REMI, and IGSL 

20 A gene delivery mixture suitable for use in the in vivo, in situ or in vitro methods of 

sperm-mediated transection comprises a nucleic acid encoding a desired trait or product, 
and a suitable promoter sequence such as, for example, a tissue-specific promoter, or an 
IRES. The transgenic nucleic acids of the present invention may further comprise an origin 
of replication. For example, an origin of replication may be the SV40 ori, or a centromere 

25 derived from the chicken. A linear nucleic acid may further comprise a telomere at one or 
both ends of the nucleic acid. 

Optionally, agents that increase the uptake of, or comprise non-eukaryotic viral 
vectors, e.g. , plasmids, BACs, YACs, etc., the nucleic acid sequence, such as liposomes, 
retroviral vectors, adenoviral vectors, adenovirus enhanced gene delivery systems, or 

30 combinations thereof may be included in the gene delivery mixture. A reporter construct 
including a genetic selection marker, such as the gene encoding for Green Fluorescent 
Protein, may also be added to the gene delivery mixture. Targeting molecules, such as c-kit 
ligand, can be added to the gene delivery mixture to enhance the transfer of genetic material 
into the male germ cell. An immunosuppressing agent such as cyclosporin or a 

35 corticosteroid may also be added to the gene delivery mixture as known in the art 
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Any of a number of commercially available gene delivery mixtures can be used, to 
which the polynucleotide encoding a desire trait or product is further admixed The final 
gene delivery mixture comprising the polynucleotide can then be admixed with the male 
gamete cells and allowed to interact for a period of between about 2 hours to about 16 
5 hours, at atemperatureof about33°C to about 37°C. After this period, the cells are 

preferably placed at a lower temperature of about 33°C to about 34 °C, for about 4 hours to 
about 20 hours, preferably about 1 6 to about 1 8 hrs. 

Isolating and/or selecting genetically transgenic germ cells (and transgenic somatic 
cells, and of transgenic vertebrates) is by any suitable means, such as, but not limited to, 
10 physiological and/or morphological phenotypes of interest using any suitable means, such as 
biochemical, enzymatic, immunochemical, histologic, electrophysiologic, biometric or like 
methods, and analysis of cellular nucleic acids, for example the presence or absence of 
specific DNAs or KNAs of interest using conventional molecular biological techniques, 
including hybridization analysis, nucleic acid amplification including, but not limited to, 
1 5 polymerase chain reaction, transcription-mediated amplification, reverse transcriptase- 
mediated ligase chain reaction, and/or electrophoretic technologies. 

One preferred method of isolating or selecting male germ cell populations comprises 
obtaining specific male germ cell populations, such as spermatogonia, from a mixed 
population of testicular cells by extrusion of the cells from the seminiferous tubules and 
20 enzyme digestion. The spermatogonia, or other male germ cell populations, can be isolated 
from a mixed cell population by methods such as the utilization of a promoter sequence that 
is specifically or selectively active in cycling male germ line stem cell populations. Suitable 
promoters include B-Myb or a specific promoter, such as the c-kit promoter region, c-raf-1 
promoter, ATM (ataxia-telangiectasia) promoter, vasa promoter, RBM (ribosome binding 
25 motif) promoter, DAZ (deleted in azoospermia) promoter, XRCC-1 promoter, HSP 90 (heat 
shock gene) promoter, cyclin Al promoter, or FRM (from Fragile X site) promoter and the 
like. A selected promoter may be linked to a reporter construct, for example, a construct 
comprising a gene encoding Green Fluorescent Protein (or EGFP), Yellow Fluorescent 
Protein, Blue Fluorescent Protein, a phycobiliprotein, such as phycoerythrin or phycocyanin, 
30 or any other protein which fluoresces under suitable wave-lengths of light, or encoding a 
Ught-emitting protein, such as luciferase or apoaequorin. The unique promoter sequences 
drive the expression of the reporter construct only during specific stages of male germ cell 
development (e.g., Mailer et al., 1999, J. Biol. Chem. 276(16), 1 1220-28; Schrans-Stassen 
et al., 1999, Endocrinology 140, 5894-5900, both of which are incorporated herein by 
35 reference in their entireties). In the case of a fluorescent reporter construct, the cells can be 
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sorted with the aid of, for example, a FACS set at the appropriate wavelengths), or they can 
be selected by chemical methods. 

Male germ cells that have the DNA modified in the desired maimer are isolated or 
selected, and transferred to the testis of a suitable recipient avian, preferably the donor male 
avian of the male germ cells. Further selection can be attempted after biopsy of one or both 
of the recipient male's testes, or after examination of the animal's ejaculate amplified by the 
polymerase chain reaction to confirm that the desired nucleic acid sequence had been 



The genetically modified germ cells isolated or selected as described above are 
10 transferred to a testis of a suitable male avian, preferably a chicken, that can be, but need not 
be, the same donor animal. Before transferring the genetically modified male germ cells to 
the recipient animal, the testes of the recipient are depopulated of endogenous germ cells, 
thereby fecilitating the colonization of the recipient testis by the genetically modified germ 
cells. Depopulation of the testis has commonly been accomplished by exposing Ihe whole 
15 animal to gamma irradiation or by localized irradiation of the testis. The basic rigid 

architecture of the gonad should not be destroyed, nor significantly damaged. Disruption of 
tubules may lead to impaired transport of testicular sperm and result in infertility. Sertoli 
cells should not be irreversibly damaged, as they provide a base for development of the 
germ cells during maturation, and for preventing the host immune defense system from 
20 destroying grafted foreign spermatogonia. 

Suitable denuding methods, include irradiation by gamma-rays, x-rays, ultrasound, 
ultraviolet light, by chemical treatment, by means of infectious agents such as viruses, or by 
autoimmune depletion or by combinations thereof preferably by a combined treatment of 
the vertebrate with an alkylating agent and gamma irradiation as taught in WO 00/69257, 
25 incorporated herein by reference in its entirety. 

Gamma radiation-induced spermatogonial degeneration probably related to the 
process of apoptosis. (Hasegawaeroi, 1998, Radiat. Res. 149:263-70). Alternatively, a 
composition containing an alkylating agent such as busulfan (MYLERAN™) canbe used to 
depopulate. (Jiang FX, 1998, Anat. Embryol. 198(1): 53-61 ; Russell and Brinster, 1996, J. 
30 Androl 17(6): 615-27; Boujrad etal., 1995, Andrologia 27(4): 223-28; Lindererai, 1992, 
Reprod. Toxicol. 6(6): 491-505; Kasuga and Takahashi, 1986, Endocrinol. Jpn 33( 1): 105- 
15). Other cytotoxic alkylating agent, may be, but is not limited to, chlorambucil, 
cyclophosphamide, melphalan, or ethyl ethanesulfonic acid, and may be combined with 
gamma irradiation, to be administered in either sequence. 
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The dose of the alkylating agent and the dose of gamma radiation are in an amount 
sufficient to substantially depopulate the testis. The alkylating agent can be administered by 
any pharmaceutical^ acceptable delivery system, including but not limited to, 
intraperitoneal, intravenous, or intramuscular injection, intravenous drip, implant, 
5 transdermal or transmucosal delivery systems. 

Tlie isolated or selected genetically modified germ cells are transferred into the 
recipient testis by direct injection using a suitable micropipette. Support cells, such as 
Leydig or Sertoli cells, that can be unmodified or genetically modified, can be transferred to 
a recipient testis along with the modified germ cells. 

10 

5.1.13 DELIVERY OF TRANSGENIC SPERM TO OOCYTES 

The transfected male avian germ cells may be used to deliver a heterologous nucleic 
acid to an avian oocyte by implanting the transfected male germ cells such as transfected 
spennatogonial precursor cells, into the testicular tissue of host male birds previously 

15 denuded of viable spennatogonial cells or sperm. The implanted transfected male avian 
germ cells may colonize the testicular tissue, proliferate therein, and generate viable 
transgenic sperm that may be harvested for use in artificial insemination procedures, or 
transferred to a recipient oocyte by natural coitus. 

In certain embodiments, therefore, the transgenic avian may be produced by the 

20 sperm-mediated transfer of at least one heterologous transgene. The transgene may be 
incorporated into the genomic nucleic acid of a spermatozoon cell or a precursor thereof, so 
that a genetically modified avian sperm is produced by the male avian- Breeding the male 
avian with a female of its species will generate a transgenic progeny carrying the at least one 
transgene in its genome. 

25 A union of male and female gametes to form a transgenic zygote is brought about by 

copulation of the male and female vertebrates of the same species, or by in vitro or in vivo 
artificial means. If artificial means are chosen, then incorporating into the genome a genetic 
selection marker that is expressed in male germ cells is particularly useful. 

Suitable artificial means include, but are not limited to, artificial ins em ination, in 

30 vitro fertilization (TVF) and/or other artificial reproductive technologies, such as 

intracytoplasmic sperm injection (ICSI), subzonal insemination (SUZI), or partial zona 
dissection (PZD). Also others, such as cloning and embryo transfer, cloning and embryo 
splitting, and the like, can be employed. 

In a preferred embodiment, a transgene is incorporated into an avian sperm by 

35 intracytoplasmic sperm injection (ICSI). The male germ cells, which may be intact and 
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viable spermatozoa, or the non-viable heads thereof, may be microinjected into the 
cytoplasm or the nucleus of an isolated oocyte such as an avian oocyte, preferably a chicken 
oocyte, by any method known to one of skill in the art, including, for example, combining a 
confocal microscope and micromanipulator, or the like to visualize and monitor the 

5 microinjection of an opaque avian egg. 

The transgenic vertebrate progeny can, in turn, be bred by natural mating, artificial 
insemination, or by in vitro fertilization (TVF) and/or other artificial reproductive 
technologies, such as intracytoplasmic sperm injection (ICSI) and chicken mtracytoplasmic 
sperm injection (CHICSI™), subzonal msemination (SUZI), or partial zona dissection 

10 (PZD), to obtain further generations of transgenic progeny. Although the genetic material is 
originally inserted solely into Ihe germ cells of a parent animal, it will ultimately be present 
in the germ cells of future progeny and subsequent generations thereof. In addition, the 
genetic material will also be present in cells of the progeny other than germ cells, Le., 
somatic cells. 

15 The methods of the present invention may further comprise returning a transfected 

fertilized oocyte to a surrogate mother, especially a female chicken, for the continued 
incubation and development of the transgenic zygote. With chickens, the developed embryo 
is laid as a hard-shell egg that will hatch as a viable chick. When the heterologous nucleic 
acid is directly integrated into the genome of the oocyte, the transgenic chick will include 

20 the transgenic heterologous nucleic acid in all of its cells. Where ihe heterologous nucleic 
acid is episomal with respect to the genome of Ihe transgenic zygote and chick, and the 
episomal nucleic acid comprises a centromeric body, most, if not all, of the cells of the 
zygote and chick will comprise the heterologous nucleic acid. When the episomal nucleic 
acid does not include a centromeric body, however, the transgenic zygote and chick can be a 

25 mosaic wherein expression of the exogenous transgene will only occur in some, but not all 
cells or tissues of the transgenic anim al. 

5.12 BREEDING AND MAINTENANCE OF TRANSGENIC AVIAN 
Another aspect of the present invention is a transgenic avian produced by the 
30 methods of the present invention and producing a heterologous polypeptide in an egg, 
wherein the transgenic avian comprises at least one heterologous nucleic acid sequence 
encoding the polypeptide and wherein the heterologous polypeptide is delivered to the white 
of an avian egg by a female of the avian. 

The invention relates to a method of producing transgenic avians that express 
35 significant quantities of useful heterologous proteins, e.g., therapeutic and diagnostic 
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proteins, including immunoglobulins, industrially useful proteins and other biologies etc. in 
the avian egg white. The heterologous protein can then be readily purified from the avian 
egg. The methods of the invention provide improved efficiencies of transgenesis, 
transmission of the transgene and/or level of heterologous protein expression. Another 
5 aspect of the invention is a method of producing a transgenic avian capable of expressing a 
heterologous protein. Therefore, the present invention relates to methods of producing 
transgenic avians, preferably chickens, wherein the incorporated transgene may be 
expressed as a constituent protein of the white of a hard-shell egg. 

Although the genetic material is originally inserted solely into the germ cells of a 
10 parent animal, it will ultimately be present in the germ cells of future progeny and 

subsequent generations thereof In addition, the genetic material will also be present in cells 
of the progeny other than germ cells, Le., somatic cells. 

Using the methods of the invention for producing transgenic avians, particularly 
methods using vectors that are not derived from eukaryotic viruses, and, preferably, the 
1 5 methods of cytoplasmic micro-injection described herein, the level of mosaicism of the 
transgene (percentage of cells containing the transgene) in avians hatched from 
microinjected embryos (le., the G.s) is greater than 5%, 10%, 25%, 50%, 75% or 90%, or is 
the equivalent of one copy per one genome, two genomes, five genomes, seven genomes or 
eight genomes, as determined by any number of techniques known in the art and described 
20 infra. 

In additional particular embodiments, the percentage of GOs that transmit the 
transgene to progeny (Gls) is greater than 5%, preferably, greater than 10%, 20%, 30%, 
40%, and, most preferably, greater than 50%, 60%, 70%, 80%, 90%. In other embodiments, 
the transgene is detected in 10%, 20%, 30%, 40%, and most preferably, greater than 50%, 
25 60%, 70%, 80%, 90% of chicks hatching from embryos into which nucleic acids have been 
introduced using methods of the invention. 

5.2 VECTORS 

A variety of vectors useful in carrying out the methods of me present invention are 
30 described herein. These vectors may be used for stable introduction of a selected 

heterologous polypeptide-coding sequence (and/or regulatory sequences) into the genome of 
an avian, in particular, to generate transgenic avians that produce exogenous proteins in 
specific tissues of an avian, and in the oviduct in particular, or in the serum of an avian. In 
still further embodiments, the vectors are used in methods to produce avian eggs containing 
35 exogenous protein. 
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In particular embodiments, preferably for use in the sperm-mediated transgenesis 
methods described herein, the vectors of the invention are not derived from eukaryotic viral 
vectors or retroviral vectors (except in certain embodiments for containing eukaryotic viral 
regulatory elements such as promoters, origins of replication, etc). In particular 

5 embodiments, the vector is not an REV, ALV or MuLV vector. In particular, useful vectors 
include, bacteriophages such as lambda derivatives, such as Xgtl 1, Xgt WES.tB, Charon 4, 
and plasmid vectors such as pBR322, pBR325, pACYC177, pACYC184, pUC8, pUC9, 
pUCl 8, pUC19, pLG339, pR290, pKC37, pKClOl, SV40, P^LUESCRIPT® H SK +/- or KS 
+/- (see "Stratagene Cloning Systems" Catalog (1993) from STRATAGENE®, La Jolla, 

10 Calif., which is hereby incorporated by reference), pQE, pIH821, pGEX, pET series (see 
Studier, F.W. et al, 1990, "Use of T7RNA Polymerase to Direct Expression of Cloned 
Genes" Gene Expression Technology 1 85, which is hereby incorporated by reference) and 
any derivatives thereof, cosmid vectors and, in preferred embodiments, artificial 
chromosomes, such as, but not limited to, YACs, BACs, BBPACs or PACs. Such artificial 

15 chromosomes are useful in that a large nucleic acid insert can be propagated and introduced 
into the avian cell. 

In other particular embodiments, as detailed above in section 52> infra, the vectors 
of the invention are derived from eukaryotic viruses, preferably avian viruses, and can be 
replication competent or, preferably, replication deficient In particular embodiments, the 

20 vectors are derived from REV, ALV or MuLV. Nucleic acid sequences or derivative or 
truncated variants thereof, may be introduced into viruses such as vaccinia virus. Methods 
for making a viral recombinant vector useful for expressing a protein under the control of 
the lysozyme promoter are analogous to the methods disclosed in U.S. Patent Nos. 
4,603,112; 4,769,330; 5,174,993; 5,505,941; 5,338,683; 5,494,807; 4,722,848; Paoletti, E, 

25 1996, Proa NatL Acad Set 93: 11349-11353; Moss, 1996, Proc. Natl. Acad Sci. 93: 

11341-11348; Roizman, 1996, Proc. Natl Acad Scu 93: 1 1307-1 1302; Frolov etal., 1996, 
Proc. Natl Acad Sci. 93: 11371-11377; Grunhaus etal., 1993, Seminars in Virology 3: 
237-252 and U.S. Patent Nos. 5,591,639; 5,589,466; and 5,580,859 relating to DNA 
expression vectors, inter alia; the contents of which are incorporated herein by reference in 

30 their entireties. 

Recombinant viruses can also be generated by transfection of plasmids into cells 
infected with virus. 

Preferably, vectors can replicate (Le., have a bacterial origin of replication) and be 
manipulated in bacteria (or yeast) and can then be introduced into avian cells. Preferably, 
35 the vector comprises a marker that is selectable and/or detectable in bacteria or yeast cells 
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and, preferably, also in avian cells, such markers include, but are not limited to, Amp', tef, 
LacZ, etc. Preferably, such vectors can accommodate (i.e., can be used to introduce into 
cells and replicate) large pieces of DNA such as genomic sequences, for example, large 
pieces of DNA consisting of at least 25 kb, 50 kb, 75 kb, 100 kb, 150 kb, 200 kb or 250 kb, 

5 such as BACs, YACs, cosmids, etc. 

The insertion of a DNA fragment into a vector can, for example, be accomplished by 
ligating the DNA fragment into a vector that has complementary cohesive termini. 
However, if the complementary restriction sites used to fragment the DNA are not present 
in the vector, the ends of the DNA molecules may be en2ymatically modified. 

10 Alternatively, any site desired may be produced by ligating nucleotide sequences (linkers) 
onto the DNA termini; these ligated linkers may comprise specific chemically synthesized 
oUgonucleotides encoding restriction endonuclease recognition sequences. In an alternative 
method, the cleaved vector and the transgene may be modified by homopolymeric tailing. 
The vector can be cloned using methods known in the art, e.g.,by me methods 

15 disclosed in Sambrook et al , (supra); Ausubel et al., 1989, Current Protocols in Molecular 
Biology, Green Publishing Associates and Wiley Interscience, N.Y., bom of which are 
hereby incorporated by reference in their entireties. Preferably, the vectors contain cloning 
sites, for example, restriction enzyme sites that are unique in the sequence of the vector and 
insertion of a sequence at that site would not disrupt an essential vector function, such as 

20 replication. 

As discussed above, vectors used in certain methods of the invention preferably can 
accommodate, and in certain embodiments comprise, large pieces of heterologous DNA 
such as genomic sequences, particularly avian genomic sequences. Such vectors can 
contain an entire genomic locus, or at least sufficient sequence to confer endogenous 

25 regulatory expression pattern, e.g., high level of expression in the magnum characteristic of 
ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and ovomucin, etc, and to 
insulate the expression of the transgene sequences from the effect of regulatory sequences 
surrounding the site of integration of me transgene in the genome. Accordingly, as detailed 
below, in preferred embodiments, the transgene is inserted in an entire genomic loci or 

30 significant portion thereof. 

To manipulate large genomic sequences contained in, for example, a BAC, 
nucleotide sequences coding for the heterologous protein to be expressed and/or other 
regulatory elements may be inserted into the BAC by directed homologous recombination in 
bacteria, e.g., the methods of Heintz WO 98/59060; Heintz et aL, WO 01/05962; Yang et 

35 
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al, 1997, Nature Biotechnol. 15: 859-865; Yang et al, immature Genetics 22: 327-35; 
which are incorporated herein by reference in their entireties. 

Alternatively, the BAC can also be engineered or modified by "E-T cloning," as 
described by Muyrers et al. (1999, Nucleic Acids Res. 27(6): 1555-57, incorporated herein 
5 by reference in its entirety). Using these methods, specific DNA may be engineered into a 
BAC independently of the presence of suitable restriction sites. This method is based on 
homologous recombination mediated by the recE and red proteins CET-cloning") (Zhang 
et al, 1998, Nat. Genet. 20(2): 123-28; incorporated herein by reference in its entirely). 
Homologous recombination can be performed between a PCR fragment flanked by short 
10 homology arms and an endogenous intact recipient such as a BAC. Using this method, 
homologous recombination is not limited by the disposition of restriction endonuclease 
. cleavage sites or the size of the target DNA. A BAC can be modified in its host strain using 
a plasmid, e.g. , pBAD-apy, in which recE and recT have been replaced by their respective 
functional counterparts of phage lambda (Muyrers et al., 1999, Nucleic Acids Res. 27(6): 
15 1555-57). Preferably, a BAC is modified by recombination with a PCR product containing 
homology arms ranging from 27-60 bp. In a specific embodiment, homology arms are 50 
bp in length. 

In another embodiment, a transgene is inserted into a yeast artificial chromosome 
(YAC) (Burke et al, 1987, Science 236: 806-12; and Peterson et al, 1997, Trends Genet. 
20 13:61, both of which are incorporated by reference herein in their entireties). 

In other embodiments, the transgene is inserted into another vector developed for the 
cloning of large segments of genomic DNA, such as a cosmid or bacteriophage PI 
(Sternberg et al., 1990, Proc. Natl. Acad, Sci. USA 87: 103-07). The approximate 
maximum insert size is 30-35 kb for cosmids and 100 kb for bacteriophage PI. In another 
25 embodiment, the transgene is inserted into a P-l derived artificial chromosome (PAC) 
(Mejia et al. , 1997, Genome Res 7:179-186). The maximum insert size is 300 kb. 

Vectors containing the appropriate heterologous sequences may be identified by any 
method well known in the art, for example, by sequencing, restriction mapping, 
hybridization, PCR amplification, etc. 
30 The vectors of the invention comprise one or more nucleotide sequences encoding a 

heterologous protein desired to be expressed in the transgenic avian, as well as regulatory 
elements such as promoters, enhancers, MARs, IRES's and other translation control 
elements, transcriptional tennination elements, polyadenylation sequences, etc, as discussed 
infra. In particular embodiments, the vector of the invention contains at least two. 

35 
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nucleotide sequences coding for heterologous proteins, for example, but not limited to, the 
heavy and light chains of an immunoglobulin. 

In a preferred embodiment, the nucleotide sequence encoding the heterologous 
protein is inserted into all or a significant portion of a nucleic acid containing the genomic 
5 sequence of an endogenous avian gene, preferably an avian gene that is expressed inthe 
magnum, e.g., ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and 
ovomucin, etc. For example, the heterologous gene sequence may be inserted into or 
replace a portion of the 3 1 untranslated region (UTR) or 5 1 untranslated region (UTR) or an 
intron sequence of the endogenous gene genomic sequence. Preferably, the heterologous 
10 gene coding sequence has its own IRES. For descriptions of IRES's, see, e.g., Jackson et 
aL, imTrendsSiochemSci. 15(12):477-83; Jang etal, 1988,/. Virol. 62(8):2636-43; 
Jang et aL, 1990, Enzyme 44(l-4):292-309; and Martinez-Salas, 1999, Curr. Qpin. 
Biotechnol. 10(5):458-64; Palmenberg et aL, United States Patent No. 4,937,190, 'winch are 
incorporated by reference herein in their entireties. In another embodiment, the 
15 heterologous protein coding sequence is inserted at the 3 1 end of the endogenous gene 
coding sequence. In another preferred embodiment, the heterologous gene coding 
sequences are inserted using 5' direct fusion wherein the heterologous gene ceding 
sequences are inserted in-frame adjacent to the initial ATG sequence (or adjacent the 
nucleotide sequence encoding the first two, three, four, five, six, seven or eight amino acids) 
20 of the endogenous gene or replacing some or all of the sequence of the endogenous gene 
coding sequence. In yet another specific embodiment, the heterologous gene coding 
sequence is inserted into a separate cistron in the 5' region of the endogenous gene genomic 
sequence and has an independent IRES sequence. 

The present invention further relates to nucleic acid vectors (preferably, not derived 
25 fiom eukaryotic viruses, except, in certain embodiments, for eukaryotic viral promoters and/ 
or enhancers) and transgenes inserted therein that incorporate multiple polypeptide- 
encoding regions, wherein a first polypeptide-encoding region is operatively linked to a 
transcription promoter and a second polypeptide-encoding region is operatively linked to an 
IRES. For example, the vector may contain coding sequences for two different 
30 heterologous proteins (e.g., the heavy and light chains of an immunoglobulin) or the coding 
sequences for all or a significant part of the genomic sequence for the gene from which the 
promoter driving expression of the transgene is derived, and the heterologous protein 
desired to be expressed (e.g., a construct containing the genomic coding sequences, 
including introns, of the avian lysozyme gene when the avian lysozyme promoter is used to 
35 drive expression of the transgene, an IRES, and the coding sequence for the heterologous 
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protein desired to be expressed downstream {i.e. , 3' on the RNA transcript of the IRES). 
Thus in certain embodiments, the nucleic acid encoding the heterologous protein is 
introduced into the 5' untranslated or 3' untranslated regions of an endogenous gene, such as 
but not limited to, ovdbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and 
5 ovomucin, with anIRES sequence directing translation of the heterologous sequence. 

Such nucleic acid constructs, when inserted into the genome of a bird and expressed 
therein, will generate individual polypeptides that may be post Rationally modified, for 
example, glycosylated or, in certain embodiments, form complexes, such as heterodimers 
with each other in the white of the avian egg. Alternatively, the expressed polypeptides may 
10 be isolated from an avian egg and combined in vitro, or expressed in a non-reproductive 
tissue such as serum. In other embodiments, for example, but not limited to, when 
expression of both heavy and light chains of an antibody is desired, two separate constructs, 
each containing a coding sequence for one of the heterologous proteins operably linked to a 
promoter (either the same or different promoters), are introduced by microinjection into 
1 5 cytoplasm of one or more embryonic cells and transgenic avians harboring both transgenes 
in their genomes and expressing both heterologous proteins are identified. Alternatively, 
two transgenic avians each containing one of the two heterologous proteins (e.g., one 
transgenic avian having a transgene encoding the light chain of an antibody and a second 
transgenic avian having a transgene encoding the heavy chain of the antibody) can be bred 
20 to obtain an avian containing both transgenes in its germline and expressing both transgene 

encoded proteins, preferably in eggs. 

Recombinant expression vectors can be designed for the expression of the encoded 
proteins in eukaryotic cells. Useful vectors may comprise constitutive or inducible 
promoters to direct expression of either fusion or non-fusion proteins. With fusion vectors, 
25 a number of amino acids are usually added to the expressed target gene sequence such as, 
but not limited to, a protein sequence for thioredoxin, a polyhistidine, or any other ammo 
acid sequence that facilitates purification of the expressed protein. A proteolytic cleavage 
site may further be introduced at a site between the target recombinant protein and the 
fusion sequence. Additionally, a region of amino acids such as a polymeric histidine region 
30 may be introduced to allow binding of the fusion protein to metallic ions such as nickel 
bonded to a solid support, and ihereby.allow purification of the fusion protein. Once the 
fusion protein has been purified, the cleavage site allows the target recombinant protein to 
be separated from the fusion sequence. Enzymes suitable for use in cleaving the proteolytic 
cleavage site include, but are not limited to, Factor Xa and thrombin. Fusion expression 
35 vectors that may be useful in the present invention include pGex (AMRAD® Corp., 
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Melbourne, Australia), pRTT5 (PHARMACIA®, Piscataway, NJ) and pMAL (NEW 
ENGLAND BIOLABS®, Beverly, MA), fusing glutathione S-transferase, protein A, or 
maltose E binding protein, respectively, to the target recombinant protein. 

Once a promoter and a nucleic acid encoding a heterologous protein of the present 

5 invention have been cloned into a vector system, it is ready to be incorporated into a host 
cell. Such incorporation can be carried out by the various forms of transformation noted 
above, depending upon the vector/host cell system. It is contemplated that the incorporation 
of the DNA of the present invention into a recipient cell may be by any suitable method 
such as, but not limited to, viral transfer, electroporation, gene gun insertion, sperm- 

10 mediated transfer to an ovum, microinjection and the like. Suitable host cells include, but 
are not limited to, bacteria, virus, yeast, mammalian cells, and the like. In particular, the 
present invention contemplates the use of recipient avian cells, such as chicken cells or 
quail cells. 

Another aspect of the present invention, therefore, is a method of expressing a 

15 heterologous polypeptide in a eukaryotic cell by transfecting an avian cell with a 

recombinant DNA comprising an avian tissue-specific promoter operably linked to a nucleic 
acid insert encoding a polypeptide and, optionally, a polyadenylation signal sequence, and 
culturing the transfected cell in a medium suitable for expression of the heterologous 
polypeptide under the control of the avian lysozyme gene expression control region. 

20 Yet another aspect of the present invention is a eukaryotic cell transformed with an 

expression vector according to the present invention and described above. In one 
embodiment of the present invention, the transformed cell is a chicken oviduct cell and the 
nucleic acid insert comprises the chicken lysozyme gene expression control region, a 
nucleic acid insert encoding a human interferon a2b and codon optimized for expression in 

25 an avian cell, and an SV40 polyadenylation sequence. 

In anolher embodiment, the transformed cell is a quail oviduct cell and the nucleic 
acid insert comprises the artificial avian promoter construct MOOT (SEQ ID NO.:l 1) 
operably linked to an interferon-encoding sequence, as described in Example 23 below. 
In yet another embodiment of the present invention, a quail oviduct cell is 

30 transfected with the nucleic acid insert comprising the MDOT artificial promoter construct 
operably linked to an erythropoietin (EPO>encoding nucleic acid, wherein the transfected 
quail produces heterologous erythropoietin. 
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5.2.1 PROMOTERS 

The vectors of the invention contain promoters that function in avian cells, 
preferably, that are tissue-specific and, in preferred embodiments, direct expression in the 
magnum or serum or other tissue such that expressed proteins are deposited in eggs, more 
5 preferably, that are specific for expression in the magnum. Alternatively, the promoter 
directs expression of the protein in the serum of the transgenic avian. Introduction of the 
vectors of the invention, preferably, generate transgenics that express the heterologous 
protein in tubular gland cells where it is secreted into the oviduct lumen and deposited, e.g. , 
into the white of an egg. In preferred embodiments, the promoter directs a level of 
10 expression of the heterologous protein in the egg white of eggs laid by GO and/or Gl chicks 
and/or their progeny that is greater than 5 ug, 10 ug, 50 ug, 100 ug, 250 ug, 500 ug or 750 
ug, more preferably greater than 1 mg, 2 mg, 5 mg, .10 mg, 20 mg, 50 mg, 100 mg, 200 mg, 
500 mg, 700 mg, 1 gram, 2 grams, 3 grams, 4 grams or 5 grams. Such levels of expression 
can be obtained using the promoters of the invention. 
15 In preferred embodiments, the promoters of the invention are derived from genes 

that express proteins present in significant levels in the egg white and/or the serum. For 
example, the promoter comprises regions of an ovalbumin, lysozyme, ovomucoid, 
ovotransferrin, conalbumin or ovomucin promoter or any other promoter that directs 
expression of a gene in an avian, particularly in a specific tissue of interest, such as the 
20 magnum or in the serum. Alternatively, the promoter used in the expression vector may be 
derived from that of the lysozyme gene that is expressed in both the oviduct and 
macrophages. Portions of two or more of these, and other promoters that function in avians, 
may be combined to produce effective synthetic promoter. 

The promoter may optionally be a segment of the ovalbumin promoter region that is 
25 sufficiently large to direct expression of the coding sequence hi the tubular gland cells. • 
Other exemplary promoters include the promoter regions of the ovalbumin, lysozyme, 
ovomucoid, conalbumin, ovotraiisferrin or ovomucin genes (for example, but not limited to, 
as disclosed in co-pending United States Patent Application Nos. 09/922,549, filed August 
3, 2001 and 10/1 14,739, filed April 1, 2002, both entitled "Avian Lysozyme Promoter", by 
30 plpp, and United States Patent Application No. 09/998,716, filed November 30, 2001, 
entitled "Ovomucoid Promoter and Methods of Use," by Harvey et al , all of which are 
mcorpomtedbyreferenceheremintheirentireties). Alternatively, the promoter may be a 
promoter that is largely, but not entirely, specific to the magnum, such as the lysozyme 
promoter. Other suitable promoters may be artificial constructs such as a combination of 
35 nucleicaddregionsderivedfromatleasttwoaviangenepromoters. One such embodiment 
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of the present invention is the MDOT construct (SEQ ID NO: 11) comprising regions 
derived from the chicken ovomucin and ovotransferrin promoters, mcludmg but not limited 
to promoters altered, e.g., to increase expression, and inducible promoters, e.g., the tef 

system. ? . 

5 The ovalbumin gene encodes a 45 kD protein that is also specifically expressed in 

the tubular gland cells of the magnum of the oviduct (Beato, 1989, Cell 56:335-344). 
Ovalbumin is the most abundant egg white protein, comprising over 50 percent of the total 
protein produced by the tubular gland cells, or about 4 grams of protein per large Grade A 
egg (Gilbert, "Egg albumen and its formation" in Physiology and Biochemistry of the 

10 z ?om e5ricF< W /,BeUandFreeman,eds.,AcadermcPre S s,London,NewYo 1291- 
1329) The ovalbumin gene and over 20 kb of each flanking region have been cloned and 
analyzed (Lai et al., 1978, Proc. Natl. Acad. Sci USA 75:2205-2209; Gannon et al., 1979, 
Nature 278:428-424; Roop et a/.,1980, Cell 19:63-68; and Royal et fl/.,1975, Nature 
279:125-132). 

15 The ovalbumin gene responds to steroid hormones such as estrogen, glucocorticoids, 

and progesterone, which induce the accumulation of about 70,000 ovalbumin mRNA 
transcripts per tubular gland cell in immature chicks and 100,000 ovalbumin mRNA 
transcripts per tubular gland cell in the mature laying hen (Palmiter, 1973, J. Biol. Chem. 
248:8260-8270; Palmiter, 1975, Cell 4:189-197). The 5' flanking region contains four 
20 DNAse I-hypersensitive sites centered at -025, -0.8, -3.2, and -6.0 kb from the transcription 
start site. These sites are called HS-L -II, -HI, and -IV, respectively. Promoters of the 
invention may contain one, all, or a combination of HS-L HS-H, HS-ffl and HS0IV. 
Hypersensitivity of HS-H and -HI are estrogen-induced, supporting a role for these regions 
in hormone-induction of ovalbumin gene expression. 
25 HS-I and HS-fl are both required for steroid induction of ovalbumin &a& 

transcription, and a 1 .4 kb portion of the 5' region that includes these elements is sufficient 
to drive steroid-dependent ovalbumin expression in explanted tabular gland cells (Sanders 
andMcKnight, 19M, Biochemistry 27: 6550-6557). HS-I is termed the negative-response 
element ("NRE") because it contains several negative regulatory elements which repress 
30 ovalbumin expression in the absence of hormone (Haekers et al., 1995, Mol. Endo. 9:1 1 13- 
1 126). Protein factors bind these elemente, including some factors only found in oviduct 
nucleisuggestmgarolemtissue-specmcexpression. HS-H is termed the steroid-dependent 
response element ("SDRE") because it is required to promote steroid induction of 
transcription. It binds a protein or protein complex known as Chirp-I. Chirp-I is induced by 
35 estrogen and turns over rapidly in the presence of cyclohexamide Pean era/., 1996, M>/. 
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Cell. Biol. 16:2015-2024). Experiments using an explanted tubular gland cell culture 
system defined an additional set effectors that bind SDRE in a steroid-dependent manner, 
mcludingaNFKB-likefector (Nordstrom etal, 1993, J. Biol. Chem. 268:13193-13202; 
Schweers and Sanders, 1991, J. Biol. Chem. 266: 10490-10497). 

5 Less is known aboutthe function of HS-ffl and HS-IV. HS-ffl contains a functional 

estrogen response element, and confers estrogen inducibuity to either the ovalbumin 
proximal promoter or a heterologous promoter when co-transfected into HeLa cells with an 
estrogen receptor cDNA. These data imply that HS-ffl may play a functional role in the 
overall regulation of the ovalbumin gene. Little is known about the function of HS-IV, 

10 except mat it does not contain a functional estrogen-response element (Kato et aL, 1992, 
Cell 68: 731-742). 

In an alternative embodiment of the invention, transgenes containing constitutive 
promoters are used, but the transgenes are engineered so that expression of the transgene 
effectively becomes magnum-specific. Thus, a method for producing an exogenous protein 

15 in an avian oviduct provided by the present invention involves generating a transgenic avian 
having two transgenes in its tubular gland cells. One transgene comprises a first coding 
sequence operably linked to a constitutive promoter. The second transgene comprises a 
second coding sequence that is operably linked to a magnum-specific promoter, where 
expression of the first coding sequence is either directly or mdirectly dependent upon the 

20 cellular presence of the protein expressed by the second coding sequence. 

Additional promoters useful in the present invention include inducible promoters, 
such as the tet operator and the metallothionein promoter which can be induced by 
treatment with tetracycline and zinc ions, respectively (Gossen«ro/., 1992,Proc. Natl. 
Acad. Sci. 89: 5547-5551 and Walden et aL, 1987, Gene 61: 317-327; incorporated herein 

25 by reference in their entireties). 

5.2.1.1 CHICKEN LYSOZYME GENE EXPRESSION CONTROL 

REGION NUCLEIC ACID SEQUENCES 
The chicken lysozyme gene is highly expressed in the myeloid lineage of 
30 hematopoietic cells, and in the tubular glands of the mature hen oviduct (Hauser et aL. 
1981, HematoL and Blood Transfusion 26: 175-178; Schutz et aL, 1978, Cold Spring 
Harbor Symp. Quart. Biol. 42: 617-624) and is therefore a suitable candidate for an efficient 
promoter for heterologous protein production in transgenic animals. The regulatory region 
of the lysozyme locus extends over at least 12 kb of DNA 5' upstream of the transcription 
35 start site, and comprises a number of elements that have been individually isolated and 
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characterized. The known elements include three enhancer sequences at about -6.1 kb, -3.9 
kb, and -2.7 kb (Gnjwal et al., 1992, Mol. Cell Biol. 12: 2339-2350; Bonifer et al., 1996, J. 
Mol. Med. 74: 663-671), a hormone responsive element (Hecht et al, 1988, E.M.B.O.J. 7: 
2063-2073), a silencer element and a complex proximal promoter. The constituent 

5 elements of the lysozyme gene expression control region are identifiable as DNAase 1 
hypersensitive chromatin sites (DHS). They may be differentially exposed to nuclease 
digestion depending upon the differentiation stage of the cell. For example, in the 
multipotent progenitor stage of myelomoncytic cell development, or in erythroblasts, the 
silencer element is a DHS. At the myeloblast stage, a transcription enchancer located -6. 1 

1 0 kb upstream from the gene transcription start site is a DHS, while at the later monocytic 
stage another enhancer, at -2.7 kb becomes DNAase sensitive (Huber et al., 1995, DMA and 

Cell Biol. 14:397-402). 

This invention also envisions the use of promoters other than the lysozyme 
promoter, including but not limited to, a cytomegalovirus promoter, an ovomucoid, 

15 conalbumin or ovotransferrin promoter or any other promoter that directs expression of a 
gene in an avian, particularly in a specific tissue of interest, such as the magnum. 

Another aspect of the methods of the present invention is the use of combinational 
promoters comprising an artificial nucleic acid construct having at least two regions 
wherein the regions are derived from at least two gene promoters, including but not limited 

20 to a lysozyme, ovomucoid, conalbumin or ovotransferrin promoter. In one embodiment of 
the present invention, the promoter may comprise a region of an avian ovomucoid promoter 
and a region of an avian oxotransferrin promoter, thereby generating the MDOT avian 
artificial promoter construct The avian MDOT promoter construct of me present invention 
has the nucleic acid sequence SEQ ID NO: 11 and is illustrated in Figure 7. This promoter 

25 is useful for allowing expression of a heterologous protein in chicken oviduct cells and may 
be operably linked to any nucleic acid encoding a heterologous polypeptide of interest 
including, for example, a cytokine, growth hormone, growth factor, enzyme, structural 
protein or the like. 

30 5.2.2 MATRIX ATTACHMENT REGIONS 

In preferred embodiments of the invention, the vectors contain matrix attachment 
regions (MARs) that preferably flank the transgene sequences to reduce position effects on 
expression when integrated into the avian genome. In feet, 5' MARs and 3' MARs (also 
referred to as "scaffold attachment regions" or SARs) have been identified in the outer 
35 boundaries of the chicken lysozyme locus (Phi-Van et al, 1988, E.M.B.O.J. 7: 655-664; 
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Phi-Van, L. and Stuffing. WE, 1996, Biochem. 35: 10735-10742). Deletion of a 1.32 kb 
or a 1.45 kb halves region, each comprising half of a 5' MAR, reduces positional vanation 
in the level of transgene expression (Phi-Van and Standing, supra). 

The 5' matrix-associated region (5 1 MAR), located about -1 1.7 kb upstream of the 
5 chicken lysozyme transcription start site, can increase the level of gene expression by 
limiting the positional effects exerted against a transgene (Phi-Van et al, 1988, supra). At 
least one other MAR is located 3' downstream of the protein encoding region. Although 
MAR nucleic acid sequences are conserved, little cross-hybridization is seen, indicating 
significant overall sequence variation. However, MARs of different species can interact 
10 W imthenucleoniatricesofheterologousspecies,tomee^ 

MAR can associate with the plant tobacco nucleomatrix as well as that of the chicken 
oviduct cells (Mlynarona et al., 1994, Cell 6: 417-426; von Kries et al, 1990, Nucleic Acids 
18: 3881-3885). 

Gene expression must be considered not only from the perspective of cis-regulatory 
15 elements associated with a gene, and their interactions with trans-acting elements, but also 
with regard to the genetic environment in which they are located. Chromosomal positioning 
effects (CPEs), therefore, are the variations in levels of transgene expression associated with 
different locations of the transgene within the recipient genome. An important factor 
governing CPE upon the level of transgene expression is the chromatin structure around a 
20 transgene, and how it cooperates with the cis-regulatory elements. The cis-elements of the 
lysozyme locus are confined within a single chromatin domain (Bonifer et al, 1996, supra; 
Sippel et al., pgs. 133-147 in Eckstein F. & Lilley DMJ. (eds), "Nucleic Acids and 
Molecular Biology", Vol. 3, 1989, Springer. 

The lysozyme promoter region of chicken is active when transfected into mouse 
25 fibroblast cells and linked to a reporter gene such as the bacterial chloramphenicol 
acetyhransferase (CAT) gene. The promoter element is also effective when transiently 
transfected into chicken promacrophage cells. In each case, however, the presence of a 5' 
MAR element increased positional independency of the level of transcription (Stief et al, 
1989, Nature 341: 343-345; Sippel et al., pgs. 257 - 265 in Houdebine L.M (ed), 
30 "Transgenic Animals: Generation and Use"). 

The ability to direct the insertion of atransgene into a site in the genome of an 
animal where the positional effect is limited offers predictability of results during the 
development of a desired transgenic animal and increased yields of the expressed product 
Sippel and Steif disclose, in U.S. Patent No. 5,731,178, which is incorporated by reference 
35 herein in its entirety, methods to increase the expression of genes introduced into eukaryotic 
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cells by flanking a transcription unit with scaffold attachment elements, in particular the 5 ' 
MAR isolated from the chicken lysozyme gene. The transcription unit disclosed by Sippel 
and Steif was an artificial construct that combined only the -6.1 kb enhancer element and 
the proximal promoter element (base position -579 to +1 5) from the lysozyme gene. Other 

5 promoter associated elements were not included. However, although individual cis- 

regulatory elements have been isolated and sequenced, together with short regions flanking 
DNA, the entire nucleic acid sequence comprising the functional 5' upstream region of the 
lysozyme gene has not been determined in its entirety and therefore not employed as a 
functional promoter to allow expression of a heterologous transgene. 

10 Accordingly, vectors of Hie invention comprise MARs, preferably both 5' and 3' 

MARs that flank the transgene, including the heterologous protein coding sequences and the 
regulatory sequences. 

523 CODON-OPTEVnZED GENE EXPRESSION 
15 Another aspect of the present invention provides nucleic acid sequences encoding 

heterologous polypeptides that are codon-optimized for expression in avian cells, and 
derivatives and fragments thereof. When a heterologous nucleic acid is to be delivered to a 
recipient cell for expression therein, the sequence of the nucleic acid sequence may be 
modified so that the codons are optimized for the codon usage of the recipient species. For 
20 example, if the heterologous nucleic acid is transfected into a recipient chicken cell, the 
sequence of the expressed nucleic acid insert is optimized for chicken codon usage. This 
may be determined from the codon usage of at least one, and preferably more than one, 
protein expressed in a chicken cell. For example, the codon usage may be determined from 
the nucleic acid sequences encoding the proteins ovalbumin, lysozyme, ovomucoid, 
25 ovotransferrin, conalbumin, and ovomucin of chicken. Briefly, the DNA sequence for the 
target protein may be optimized using the BACKTRANSLATE® program of the Wisconsin 
Package, version 9.1 (Genetics Computer Group, Inc., Madison, WI) with a codon usage 
table compiled from the chicken (Gallus gallus) ovalbumin, lysozyme, ovomucoid, 
ovotransferrin, conalbumin, and ovomucin proteins. The template and primer 
30 oligonucleotides are then amplified, by any means known in the art, including but not 
limited to PCR with Pju polymerase (STRATAGENE®, La Jolla CA). 

In one exemplary embodiment of a heterologous nucleic acid for use by the methods 
ofthe present invention, a nucleic acid insert encoding the human interferon o2b 
polypeptide optimized for codon-usage by the chicken is microinjected into the cytoplasm 

35 
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of astage 1 embryo. Optimization ofthe sequence for codon usage is useful in elevating the 

level of translation in avian eggs. 

It is contemplated to be within the scope ofthe present invention for any nucleic 
acid encoding a polypeptide to be optimized for expression in avian cells. It is farther 
5 contemplated that the codon usage may be optimized for a particular avian species used as a 
source ofthe host cells. In one embodiment of the present invention, the heterologous 
polypeptide is encoded using the codon-usage of a chicken. 

5.2.4 SPECIFIC VECTORS OF THE INVENTION 

10 In a preferred embodiment, a transgene ofthe invention comprises a chicken, or 

other avian, lysozyme control region sequence which directs expression ofthe coding 
sequence within the transgene. A series of PCR amplifications of template chicken 
genomic DNA are used to isolate the gene expression control region ofthe chicken 
lysozyme locus. Two amplification reactions used the PCR primer sets 5pLMAR2 (5'- 
15 TGCCGCCTTCTTTGATATTC-3') (SEQ ID NO: 1) and LE-6.1kbrevl (5'- 
TTGGTGGTAAGGCCnTTTTG-3') (SEQ ID NO: 2) (Set 1) and lys-6.1 (5 1 - 
CTGGCAAGCTGTCAAAAACA-3') (SEQ ID NO: 3) and LysElRev (5'- 
CAGCTCACATCGTCCAAAGA-3 1 ) (SEQ ID NO: 4) (Set 2). The amplified PCR 
products were united as a contiguous isolated nucleic acid by a third PCR amplification step 
20 with the primers SEQ ED NOS: land 4. 

The isolated PCR-amplified product, comprising about 12 kb ofthe nucleic acid 
region 5' upstream ofthe native chicken lysozyme gene locus, was cloned into the plasmid 
pCMV-LysSPEFNMM. P CMV-LysSPIFNMM comprises a modified nucleic acid insert 
encoding a human interferon o2b sequence and an SV40 polyadenylation signal sequence 
25 (SEQ ID NO: 8) 3' downstream ofthe interferon encoding nucleic acid. The sequence SEQ 
ID NO: 5 ofthe nucleic acid insert encoding human interferon o2b was in accordance with 
avian cell codon usage, as determined from the nucleotide sequences encoding chicken 
ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and ovomucin. 

The nucleic acid sequence (SEQ ID NO: 6) (GenBank Accession No. AF405538) of 
30 theinsertinpAVDCR-A115.93.12isshowninFigureslA-E. The modified human 
interferon o2b encoding nucleotide sequence SEQ ID NO: 5 (GenBank Accession No. 
AF405539) and the novel chicken lysozyme gene expression control region SEQ ID NO: 7 
(GenBank AccesaonNo.AF405540^^ A 
polyadenylation signal sequence that is suitable for operably linking to the polypeptide- 

35 
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encoding nucleic acid insert is the SV40 signal sequence SEQ ID NO: 8, as shown in Figure 
4. 

The plasmid pAVDCR-Al 15.93.1.2 was restriction digested with enzyme Fsel to 
isolate a 15.4 kb DNA containing the lysozyme 5' matrix attachment region (MAR) and the 
5 -12.0 kb lysozyme promoter during the expression of the interferon-encoding insert, as 
describedinExample 17, below. Plasmid pIHilys was restriction digested with Mlul and 
Xhol to isolate an approximately 6 kb nucleic acids, comprising the 3' lysozyme domain, the 
sequence of which (SEQ ID NO: 9) is shown in Figures 5 A-C. The 15.4kband6kb 
nucleic acids were ligated and the 21 .4 kb nucleic acid comprising the nucleic acid sequence 

10 SEQ ID NO: 1 0 as shown in Figures 6A-J was transformed into recipient STBL4 cells. 

The inclusion of the novel isolated avian lysozyme gene expression control region of 
the present invention upstream of a codon-optimized interferon-encoding sequence in 
pAVUCR-Al 1 5.93.12 allowed expression of the interferon polypeptide in avian cells 
transfected by sperm-mediated transferor, The 3' lysozyme domain SEQ ID NO: 9, when 

15 operably linked downstream of a heterologous nucleic acid insert, also allows expression of 
the nucleic acidinsert as describedinExample 18, below. For example, the nucleic acid 
insert may encode aheterologous polypeptide such as the a2b interferon encoded by the 

sequence SEQ ID NO: 5. 

It is further contemplated that any nucleic acid sequence encoding a polypeptide may 

20 be operably linked to the novel isolated avian lysozyme gene expression control region 
(SEQ ID NO: 7) and optionally operably linked to the 3' lysozyme domain SEQ ID NO. 9 so 
as to be expressed in a transfected avian cell. The plasmid construct pAVDCR-Al 15.93.1. 2 
can be introduced into cultured quail oviduct cells by transfection. ELISA assays of the 
cultured media showed that the transfected cells synthesized a polypeptide detectable with 

25 anti-human interferon a2b antibodies. 

The isolated chicken lysozyme gene expression control region (SEQ ID NO: 7) for 
nse in the methods of the present invention comprises the nucleotide elements that are 
positioned 5' upstream of the lysozyme-encoding region of the native chicken lysozyme 
locus and which are necessary for the regulated expression of a downstream polypeptide- 

30 encoding nucleic acid. WMe not wishmg to Abound by any one meory, me inclusion of at 
least one 5' MAR sequence of or reference element in the isolated control region may 
confer positional independence to a transfected gene operably linked to the novel lysozyme 

gene expression control region. 

The isolated lysozyme gene expression control region (SEQ ID NO: 7) of the present 
35 invention is useful for reducing the chromosomal positional effect of a transgene operably 
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linked to the lysozyme gene expression control region and transfected into a recipient avian 
cell. By isolating a region of the avian genome extending from a point 5' upstream of a 5' 
MAR of the lysozyme locus to the junction between the signal peptide sequence and a 
polypeptide-encoding region, cis-regulatory elements are also included that may allow gene 
5 expression in a tissue-specific manner. The lysozyme promoter region of the present 
invention, therefore, will allow expression of an operably linked heterologous nucleic acid 
insert in a transfected avian cell such as, for example, an oviduct cell. 

It is further contemplated that a recombinant DNA of the present invention may 
further comprise the chicken lysozyme 3' domain (SEQ. ID NO: 9) linked downstream of 
10 thenucleicacidimertenccdingaheterologouspolypeptide. The lysozyme 3' domain (SEQ 
ID NO: 9) includes a nucleic acid sequence encoding a 3' MAR domain that may cooperate 
with a 5 1 MAR to direct the insertion of me construct of the present invention into the 
chromosome of atransgenic avian, or may act independently of the 5' MAR 

Fragments of a nucleic acid encoding a portion of the subject lysozyme gene 
1 5 expression control region may also be useful as an autonomous gene regulatory element that 
inayitselfbeopcrablylinkedtoapolypeptide-encoamgn^ Alternatively, the 

fragment may be combined with fragments derived from other gene promoters, such as an 
avian ovalbumin, vomucoid, ovotransferrin, conalbumin or ovomucin promoter, thereby 
generating novel promoters having new properties or a combination of properties. As used 
20 herein, a fragment of the nucleic acid encoding an active portion of a lysozyme gene 

expression control region refers to a nucleotide sequence having fewer nucleotides than the 
nucleotide sequence encoding the entire nucleic acid sequence of Ihe lysozyme gene 
expression control region, but at least 200 nucleotides. 

The present invention also contemplates the use of antisense nucleic acid molecules 
25 that are designed to be complementary to a coding stand of a nucleic acid (i.e., 
complementary to an endogenous DNA or an mRNA sequence) or, alternatively, 
complimentary to a 5' or 3' untranslated region of the mRNA and therefore useful for 
regulating the expression of a gene by the lysozyme promoter. 

• Synthesized oligonucleotides can be produced in variable lengths when for example, 
30 non-naturaUy occurring polypeptide sequences are desired. The number of bases 

synthesized will depend upon a variety of factors, including the desired use for the probes or 
primers. Additionally, sense or anti-sense nucleic acids or oligonucleotides can be 
chemically synthesized using modified nucleotides to increase the biological stability of the 
molecule or of the binding complex formed between the anti-sense and sense nucleic acids. 
35 For example, acridine substituted nucleotides can be synthesized. Protocols for designing 
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isolated nucleotides, nucleotide probes, and/or nucleotide primers are well-known to those 
of ordinary skill, and can be purchased commercially from a variety of sources (e.g., 
- SIGMA GENOSYS®, The Woodlands, TX or The Great American Gene Co., Ramona, 
CA). 

5 

52.5 RECOMBINANT EXPRESSION VECTORS 

A useful application of the novel promoters of the present invention, such as the 
avian lysozyme gene expression conlrol region (SEQ ID NO: 7) or the MDOT promoter 
construct (SEQ ID NO: 1 1) is the possibility of increasing the amount of a heterologous 
10 protein present in a bird, especially a chicken, by gene transfer. In most instances, a 

heterologous polvpepude-encoding nucleic acid insert transferred into the recipient animal 
host will be operably linked with a gene expression control region to allow the cell to 
initiate and continue production of the genetic product protein. A recombinant DNA 
molecule of the present invention can be transferred into me extra-chromosomal or genomic 

15 DNA of the host 

Expression of aforeign gene in an avian cell permits partial or complete pos- 
tradiational modification such as, but not only, glycosylate and/or the formation of the 
relevant inter- or intra-chain disulfide bonds.- Examples of vectors useful for expression in 
the chicken Callus gallus include pYepSecl (Baldari et aL, 1987, EMB.O.J., 6: 229-234; 

20 incorporated herein by reference in its entirety) and P YES2 (MVTTROGEN® Corp., San 
Diego, CA). 

The present invention contemplates that the injected cell may transiently contain the 
injected DNA, whereby the recombinant DNA or expression vector may not be integrated 
into the genomic nucleic acid. It is former contemplated mat the injected recombinant DNA 

25 or expression vector may be stably integrated into the genomic DNA ofthe recipient cell, 
thereby rephcating with the cell so that each daughter cell receives a copy ofthe injected 
nucleic acid. It is still further contemplated for the scope of the present invention to include 
a transgenic animal producing a heterologous protein expressed from an injected nucleic 
acid according to the present invention. 

30 Heterologous nucleic acid molecules can be delivered to oocytes using the sperm- 

mediated transfection methods ofthe present invention. The nucleic acid molecule may be 
inserted into a cell to which the nucleic acid molecule (or promoter coding region) is 
heterologous (i*., not normally present). Alternatively, the recombinant DNA molecule 
may be introduced into cells which normally contain the recombinant DNA molecule or the 

35 
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particular coding region, as, for example, to correct a deficiency in the expression of a 
polypeptide, or where over-expression of the polypeptide is desired. 

Another aspect of the present invention, therefore, is a method of expressing a 
heterologous polypeptide in an avian cell by transfecting the avian cell with a selected 
5 heterologous nucleic acid comprising an avian promoter operably linked to a nucleic acid 
insertencodmgapolypeptideand,optionaUy,apolyadenylationsign^ The 
tweeted cell, which may be an avian embryonic cell microinjected with a heterologous 
nucleic acid, will generate a transgenic embryo that after introduction into a recipient hen 
will be laid as a hard-shell egg and develop into a transgenic chick. 
10 In another embodiment of me present invention, the nucleic acid insert comprises 

the chicken lysozyme gene expression control region, a nucleic acid insert encoding a 
human interferon a2b and codon optimized for expression in an avian cell, and a chicken 3' 
domain, i.e., downstream enhancer elements. 

In one embodiment of the present invention, the transgenic animal, is an avian 
15 S electedfromatokey,duck,goose,qua^ 
bird. ha^te embodiment, the 

produced under the transcriptional control of the avian promoter is produced in the white of 
an egg. In yet another embodiment of the present invention, the heterologous polypeptide is 
produced in the serum of a bird. 

20 

53 HETEROLOGOUS PROTEINS PRODUCED BY TRANSGENIC 
AVIANS 

Methods of the present invention, providing for the production of heterologous 
protein in the avian oviduct (or other tissue leading to deposition of the protein into the egg) 
25 and the production of eggs containing heterologous protein, involve providing a suitable 
vector coding for the heterologous protein and introducing the vector into oocytes by sperm- 
mediated transfection such that the vector is integrated into the genome of the resulting 
' transgenic embryo. A subsequent step involves deriving a mature transgenic avian from die 

transgenic embryo produced in the previous steps by transferring the injected cell or cells 
30 into the infundibulum of a recipient hen; producing a hard shell egg from that hen; and 
allowing the egg to develop and hatch to produce atransgenic bird. 

A transgenic avian so produced from transgenic embryonic cells is known as a 
founder. Such founders may be mosaic for the transgene (in certain embodiments, the 
founder has 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 90%, 100% of the cells containing 
35 thetransgene. The invention farther provides production of heterologous proteins in other 
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tissues of the transgenic avians. Some founders will carry the transgene in the tubular gland 
cells in the magnum of their oviducts. These birds will express the exogenous protein 
encoded by the transgene in their oviducts. If the exogenous protein contains the 
appropriate signal sequences, it will be secreted into the lumen of the oviduct and into the 
5 white of an egg. 

Some founders are germ-line founders. A germ-line founder is a founder that carries 
the transgene in genetic material of its germ-line tissue, and may also carry the transgene in 
oviduct magnum tubular gland cells that express the exogenous protein. Therefore, in 
accordance with the invention, the transgenic bird may have tubular gland cells expressing 
10 the exogenous protein and the offspring of the transgenic bird will also have oviduct 

magnum tubular gland cells that express the exogenous protein. Alternatively, the offspring 
express a phenotype determined by expression of the exogenous gene in a specific tissue of 
the avian. In preferred embodiments, the heterologous proteins are produced from 
transgenic avians that were not (or the founder ancestors were not) using a eukaryotic viral 
15 vector, or a retroviral vector. 

The present invention can be used to express, in large yields and at low cost, a wide 
range of desired proteins including those used as human and animal pharmaceuticals, 
diagnostics, and livestock feed additives. Proteins such as growth hormones, cytokines, 
structural proteins and enzymes, including human growth hormone, interferon, lysozyme, 
20 and p-casein, are examples of proteins that are desirably expressed in the oviduct and 
deposited in eggs according to the invention. Other possible proteins to be produced 
include, but are not limited to, albumin, a-1 antitrypsin, antithrombin HI, collagen, factors 
VIE, IX, X (and the like), fibrinogen, hyaluronic acid, insulin, lactoferrin, protein C, 
erythropoietin (EPO), granulocyte colony-stimulating factor (G-CSF), granulocyte 
25 macrophage colony-stimulating factor (GM-CSF), tissue-type plasminogen activator (tPA), 
feed additive enzymes, somatotropin, and chymotrypsin Immunoglobulins and genetically 
engineered antibodies, including immunotoxins that bind to surface antigens on human 
tumor cells and destroy them, can also be expressed for use as pharmaceuticals or 
diagnostics. It is contemplated that immunoglobulin polypeptides expressed in avian cells 
30 following transfection by the methods of the present invention may include monomeric 
heavy and tight chains, single-chain antibodies or multimeric immunoglobulins comprising 
variable heavy and tight chain regions, i.e., antigen-binding domains, or intact heavy and 
light immu noglobulin chains. 
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53.1 MULTIMERIC PROTEINS 

The invention, in preferred embodiments, provides methods for producing 
multimeric proteins, preferably immunoglobulins, such as antibodies, and antigen binding 
fragments thereof. 

5 In one embodiment of the present invention, the multimeric protein is an 

immunoglobulin, wherein the first and second heterologous polypeptides are an 
immunoglobulin heavy and light chains respectively. Illustrative examples of Ibis and other 
aspects and embodiments of the present invention for the production of heterologous 
- multimeric polypeptides in avian cells are fully disclosed in U.S. Patent Application No. 

10 09/877,374, filed June 8, 2001, by Rapp, which is incorporated herein by reference in its 
entirety^ In one embodiment of the present invention, therefore, the multimeric protein is an 
immunoglobulin wherein the first and second heterologous polypeptides are an 
immunoglobulin heavy and light chain respectively. Accordingly, the invention provides 
immunoglobulin and other multimeric proteins that have been produced by transgenic 

15 avians of the invention. 

In the various embodiments of this aspect of the present invention, an 
immunoglobulin polypeptide encoded by the transcriptional unit of at least one expression 
vector may be an immunoglobulin heavy chain polypeptide comprising a variable region or 
a variant thereof, and may further comprise a D region, a J region, a C region, or a 

20 combination thereof An immunoglobulin polypeptide encoded by the transcriptional unit 
of an expression vector may also be an immunoglobulin light chain polypeptide comprising 
a variable region or a variant Ihereof, and may further comprise a J region and a C region. It 
is also contemplated to be within the scope of the present invention for the immunoglobulin 
regions to be derived from the same animal species, or a mixture of species including, but 

25 not only, human, mouse, rat, rabbit and chicken. In preferred embodiments, the antibodies 

are human or humanized. 

In other embodiments of the present invention, the immunoglobulin polypeptide 
encoded by the transcriptional unit of at least one expression vector comprises an 
immunoglobulin heavy chain variable region, an immunoglobulin fight chain variable 
30 region, and a linker peptide thereby forming a single-chain antibody capable of selectively 
binding an antigen. 

Another aspect of the present invention provides a method for the production in an 
avian of an heterologous protein capable of forming an antibody suitable for selectively 
binding an antigen comprising the step of producing a transgenic avian incorporating at 
35 least one transgene, wherein the transgene encodes at least one heterologous polypeptide 
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selected from an immunoglobulin heavy chain variable region, an immunoglobulin heavy 
chain comprising a variable region and a constant region, an immunoglobulin light chain 
variable region, an immunoglobulin light chain comprising a variable region and a constant 
region, and a single-chain antibody comprising two peptide-linked immunoglobulin variable 

5 regions. Preferably, the antibody is expressed such that it is deposited in the white of the 
developing eggs of the avian. The hard shell avian eggs thus produced can be harvested and 
the heterologous polypeptide capable of forming or which formed an antibody can be 
isolated from the harvested egg. It is also understood that the heterologous polypeptides 
may also be expressed under the transcriptional control of promoters that allow for release 

10 of the polypeptides into the serum of the transgenic animal. Exemplary promoters for non- 
tissue specific production of a heterologous protein are the CMV promoter and the RSV 
promoter 

In one embodiment of this method of the present invention, the transgene comprises 
a transcription unit encoding a first and a second immunoglobulin polypeptide operatively 

15 linked to a transcription promoter, a transcription terminator and, optionally, an internal 
ribosome entry site (IRES) (see, for example, U.S. Patent No. 4,937,190 to Palmenberg et 
at, the contents of which is incorporated herein by reference in its entirety). 

In an embodiment of this method of the present invention, the isolated heterologous 
protein is an antibody capable of selectively binding to an antigen. In this embodiment, the 

20 antibody may be generated witiun the serum of an avian or within the white of the avian egg 
by combining at least one immunoglobulin heavy chain variable region and at least one 
immunoglobulin tight chain variable region, preferably cross-linked by at least one di- 
sulfide bridge. The combination of the two variable regions will generate a binding site 
capable of binding an antigen using methods for antibody reconstitute that are well known 

25 in the art 

It is, however, contemplated to be within the scope of the present invention for 
immunoglobulin heavy and light chains, or variants or derivatives thereof, to be expressed 
in separate transgenic avians, and therefore isolated from separate media including serum or 
eggs, each isolate comprising a single species of immunoglobulin polypeptide. The method 

30 may further comprise the step of combining a plurality of isolated heterologous 

immunoglobulin polypeptides, thereby producing an antibody capable of selectively binding 
to an antigen In this embodiment, two individual transgenic avians may be generated 
wherein one transgenic produces serum or eggs having an immunoglobulin heavy chain 
variable region, or a polypeptide comprising such, expressed therein. A second transgenic 

35 animal, having a second transgene, produces serum or eggs having an immunoglobulin light 
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chain variable region, or a polypeptide comprising such, expressed therein. The 
polypeptides may be isolated 60m their respective sera and eggs and combined in vitro to 
generate a binding site capable of binding an antigen. 

Examples of therapeutic antibodies that can be used in methods of the invention 
5 include but are not limited to HERCEPTIN® (Trastuzumab) (Genentech, CA) which is a 
humanized anti-HER2 monoclonal antibody for the treatment of patients with metastatic 
breast cancer, REOPRO® (abciximab) (Centocor) which is an anti-glycoprotein lib/ma 
receptor on the platelets for the prevention of clot formation; ZENAPAX® (daclizumab) 
(Roche Pharmaceuticals, Switzerland) which is an immunosuppressive, humanized anti- 
10 CD25 monoclonal antibody for the prevention of acute renal allograft rejection; 

PANOREX™ which is a murine anti-17-IA cell surface antigen IgG2a antibody (Glaxo 
WeUcome/Centocor); BEC2 which is a murine anti-idiotype (GD3 epitope) IgG antibody 
(ImClone System); IMC-(^5wmch is a chimeric anti-EGFRIgG antibody (ImClone 
System); VITAXIN™ which is ahumanized anti-oVp3 integrin antibody (Apptied 
15 Molecular EvolutionMedlmmune); Campath 1H/LDP-03 which is a humanized anti CD52 
IgGl antibody (Leukosite); Smart M195 which is ahumanized anti-CD33 IgG antibody 
(Protein Design Lab/Kanebo); RITUXAN™ which is a chimeric anti-CD20 IgGl antibody 
(IDEC Pharm/Genentech, Roche/Zettyaku); LYMPHOCIDE™ which is a humanized ant>- 
CD22 IgG antibody (hnmunomedics); ICM3 is a humanized anti-ICAM3 antibody (ICOS 
20 Pharm);roEC-114isaprimatiedanti-CD80antibody(roECPlian^ 

ZEVALM™ is aradiolabeUed murine anti-CD20 antibody (IDEC/Schering AG); IDEC- 
131 isahuniaiuzedanti-CmOLaiitib^ 

antibody (IDEC); IDEC-152 is a primatized anti-CD23 antibody (DDEC/Seikagaku); 

SMART anti-CD3 is a humanized anti-CD3 IgG (Protein Design Lab); 5G1.1 is a 
25 " humanized anti-complement factor 5 (C5) antibody (Alexion Pharm); D2E7 is a humanized 

anti-TNF-o antibody (CAT/BASF); CDP870 is ahumanized anti-TNF-a Fab fragment 

(<> U tech);roEC-151isaprimati^ (IDEC Pharm/SmithKline 

Beecham); MDX-CD4 is a human anti-CD4 IgG antibody (Medarex/Eisai/Genmab); 

CDP571 is a humanized anti-TNF-o IgG4 antibody (Celltech); LDP-02 is a humanized anti- 
30 0 4P7 antibody (LeukoSite/Genentech); OrthoClone OKT4A is ahumanized anti-CD4 IgG 

antibody (Ortho Biotech); ANTOVA™ is a humanized anti-CD40L IgG antibody (Biogen); 

ANTEGREN™ is a humanized anti-VLA-4 IgG antibody (Elan); and CAT-152 is a human 

anti-TGF-02 antibody (Cambridge Ab Tech). 
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10 



53.2 PROTEIN RECOVERY 

The protein of the present invention may be produced in purified form by any known 
conventional technique. For example, chicken cells may be homogenized and centrifuged. 
The supernatant can then be subjected to sequential ammonium sulfate precipitation and 
heat treatment The fraction containing the protein of the present invention is subjected to 
gel filtration in an appropriately sized dextran or polyacrylamide column to separate the 
proteins. Ifnecessary.meprotemfractionnmybefurmerpurifiedbyHPLC. In another 
embodiment, an affinity column is used, wherein the protein is expressed with a tag. 

Accordingly, the invention provides proteins that are produced by transgenic avians 
of the invention. In a preferred embodiment, the protein is produced and isolated from an 
avian egg. In another embodiment, the protein is produced and isolated from avian serum. 

5.4 PHARMACEUTICAL COMPOSITIONS 

The present invention further provides pharmaceutical compositions, formulations, 
1 5 dosage unite and methods of administration comprising the heterologous proteins produced 
by the transgenic avians using methods of the invneion. Preferably, compositions of the 
invention comprise aprophylactically or therapeutically effective amount of a the 
heterologous protein, and a pharmaceutically acceptable carrier. 

The term "carrier" refers to a diluent, adjuvant, excipient, or vehicle with which a 
20 compound of the invention is administered. Such pharmaceutical vehicles can be liquids, 
such as water and oils, including those of petroleum, animal, vegetable or synthetic origin, 
such as peanut oil, soybean oil, mineral oil, sesame oil and the like. The pharmaceutical 
vehicles can be saline, gum acacia, gelatin, starch paste, talc, keratin, colloidal silica, urea, 
and the like. In addition, auxiliary, stabilizing, thickening, lubricating and coloring agents 
25 may be used. When administered to a patient, the compounds of the invention and 
pharmaceutically acceptable vehicles are preferably sterile. Water is a preferred vehicle 
when the compound of the invention is administered inttavenously. Saline solutions and 
aqueous dextrose and glycerol solutions can also be employed as liquid vehicles, 
particularly for injectable solutions. Suitable pharmaceutical vehicles also include 
30 excipients such as starch, glucose, lactose, sucrose, gelatin, malt, rice, flour, chalk, sUica 
gel, sodium stearate, glycerol monostearate, tdc, socHum cMoride, dried skim milk, 
glycerol, propyleneglycol, water, ethanol and the like. The present compositions, if desired, 
can also contain minor amounts of wetting or emulsifying agents, or pH buffering agents. 
The present compositions can take the form of solutions, suspensions, emulsion, 
35 tablets, pills, pellets, capsules, capsules containing liquids, powders, sustained-release 
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formulations, suppositories, emulsions, aerosols, sprays, suspensions, or any other form 
suitable for use. In one embodiment, the pharmaceutical* acceptable vehicle is a capsule 
(see e.g., U.S. Patent No. 5,698,155).' Other examples of suitable pharmaceutical vehicles 
are described in "Remington: the Science and Practice of Pharmacy", 20th ed., by Mack 

5 Publishing Co. 2000. 

In a preferred embodiment, the heterologous proteins are formulated in accordance 
with routine procedures as a pharmaceutical composition adapted for intravenous 
administration to human beings. Typically, compounds of the invention for intravenous 
administration are solutions in sterile isotonic aqueous buffer. Where necessary, the 
10 compositions may also include a solubilizing agent Compositions for intravenous 

administration may optionally include a local anesthetic such as Ugnocaine to ease pain at 
the site of the injection. Generally, the ingredients are supplied either separately or mixed 
together in unit dosage form, for example, as a dry lyophilized powder or water free 
concentrate in a hermetically sealed container such as an ampoule or sachette indicating the 
15 quantity of active agent Where the heterologous protein of the invention is to be 
admmistered by infusion, it can" be dispensed, for example, with an infusion bottle 
containing sterile pharmaceutical grade water or saline. Where the composition of the 
invention is administered by injection, an ampoule of sterile water for injection or saline can 
be provided so that the ingredients may be mixed prior to adrrunistration. 
20 Compositionsfororaldehverymaytemmefo^ 

oily suspensions, granules, powders, emulsions, capsules, syrups, or elixirs, for example. 
Orally administered compositions may contain one or more optional agents, for example, 
sweetening agents such as fructose, aspartame or saccharin; flavoring agents such as 
peppermint oil of wintergreen, or cherry, coloring agents; and preserving agents, to provide 
25 a pharmaceutical^ palatable preparation. Moreover, where in tablet or pill form, the 
compositions may be coated to delay disintegration and absorption in the gastrointestinal 
tract thereby providing a sustained action over an extended period of time. Selectively 
permeable membranes surrounding an osmotically active driving compound are also 
suitable for orally administered compounds of the invention. In these later platforms, fluid 
30 from the environment surrounding the capsule is imbibed by the driving compound, which 
swells to displace the agent or agent composition through an aperture, these delivery 
platforms can provide an essentially zero order delivery profile as opposed to the spiked 
profiles of immediate release formulations. A time delay material such as glycerol , 
monostearate or glycerol stearate may also be used. Oral compositions can include standard 

35 
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vehicles such as mannitol, lactose, starch, magnesium stearate, sodium saccharin, cellulose, 
magnesium carbonate, etc. Such vehicles are preferably of pharmaceutical grade. 

Further, the effect of the heterologous proteins may be delayed or prolonged by 
proper formulation. For example, a slowly soluble pellet of the compound may be prepared 

5 and incorporated in a tablet or capsule. The technique may be improved by making pellets 
of several different dissolution rates and filling capsules with a mixture of the pellets. 
Tablets or capsules may be coated with a film which resists dissolution for a predictable 
period of time. Even the parenteral preparations may be made long-acting, by dissolving or 
suspending the compound in oily or emulsified vehicles which allow it to disperse only 

10 slowly in the serum. 

5.5 TRANSGENIC AVIANS 

Another aspect of the present invention concerns transgenic avians, preferably 
chicken or quail, produced by methods of the invention described in section 5.1 infra, 
1 5 preferably by introducing a nucleic acid comprising a transgene into an avian oocyte by the 
sperm-mediated transfection methods of the present invention. In one embodiment, a 
heterologous nucleic acid introduced to an avian oocyte by sperm-mediated transfection, 
resulting in a transgenic embryo which is then allowed to develop, preferably, transferred 
into the reproductive tract of a recipient hen where it is encapsulated by natural egg white 
20 proteins and a natural egg shell, then it is incubated and hatched to produce a transgenic 
chick. The heterologous polypeptide or polypeptides encoded by the transgenic 
heterologous nucleic acid may be secreted into the oviduct lumen of me mature transgenic 
chicken and deposited as a constituent component of egg white. The resulting transgenic 
avian chick (ie, the GO) will carry one or more desired transgene(s) some or all of its cells, 
25 preferably in its germ line. These GO transgenic avians can be bred using methods well 
known in the art to generate second generation (ie., Gls) transgenic avians that carry the 
transgene, ie., achieve gennline transmission of the transgene. In preferred embodiments, 
the methods of the invention result in gennline transmission, ie., percentage of GOs that 
transmit the transgene to progeny (Gls), that is greater than 5%, preferably, greater than 
30 10%, 20%, 30%, 40%, and, most preferably, greater than 50%, 60%, 70%, 80%, 90% or 
even 100%. In other embodiments, the efficiency of transgenesis (i.e., number of GOs 
containing the transgene) is greater than 2%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 
80% or 99%. 

The egg can be harvested after laying and before hatching of a chick, or further 
35 incubated to generate a cloned chick, optionally genetically modified. The cloned chick 
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may carry a transgene in all or most of its cells. After maturation, the transgenic avian may 
lay eggs that contain one or more desired heterologous protein(s). 

" The cloned chick may also be a knock-in chick expressing an alternative phenotype 
or capable of laying eggs having an heterologous protein therein. The reconstructed egg 
5 may also be cultured to term using the ex ovo method described by Perry et at (supra). 

Following maturation, the transgenic avian and/or transgenic progeny thereof, may 
lay eggs containing one or more desired heterologous protein(s) expressed therein and that 
can be easily harvested therefrom. The Gl chicks, when sexually mature, can then be bred 
to produce progeny that are homozygous or heterozygous for the transgene. 
10 A transgenic avian of the invention may contain at least one transgene, at least two 

transgenes, at least 3 transgenes, at least 4 transgenes, at least 5 transgenes, and preferably, 
though optionally, may express the subject nucleic acid encoding a polypeptide in one or 
more cells in the animal, such as the oviduct cells of the chicken. In embodiments of the 
present invention, the expression of the transgene may be restricted to specific subsets of 
15 cells, tissues, or developmental stages utilizing, for example, cis-acting sequences that 
control expression in the desired, pattern. Toward this end, it is contemplated that tissue- 
specific regulatory sequences, or tissue-specific promoters, and conditional regulatory 
sequences may be used to control expression of the transgene in certain spatial patterns. 
Moreover, temporal patterns of expression can be provided by, for example, conditional 
20 recombination systems or prokaryotic transcriptional regulatoiy sequences. The inclusion 
of a 5' MAR region, and optionally the 3' MAR on either end of the sequence, in the 
expression cassettes suitable for use in the methods of the present invention may aUow the 
heterologous expression unit to escape the chromosomal positional effect (CPE) and 
therefore be expressed at a more uniform level in transgenic tissues that received the 
25 transgene by a route other than through germ line cells. 

The transgenes may, in certain emtodiments, be expressed conditionally, the 
heterologous protein coding sequence is under the control of an inducible promoter, such as 
a prokaryotic promoter or operator that requires a prokaryotic inducer protein to be 
activated. Operators present in prokaryotic cells have been extensively characterized in vivo 
30 and in vitro and can be readily manipulated to place them in any position upstream from or 
within a gene by standard techniques. Such operators comprise promoter regions and 
regions that specifically bind proteins such as activators and repressors. One example is the 
operatorregionoffoeJe^geneofiJ. coli to which the LexA polypeptide binds. Other 
exemplary prokaryotic regulatory sequences and the corresponding trans-activating 
35 prokaryotic proteins are disclosed by Brent and Ptashne in U.S. Patent No. 4,833,080 (the 
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contents of which.is herein incorporated by reference in its entirety). Transgenic animals 
can be created which, harbor the subject transgene under transcriptional control of a 
prokaryotic sequence or other activator sequence that is not appreciably activated by avian 
proteins. Breeding of this transgenic animal with another animal that is transgenic for the 
5 corresponding trans-activator can be used to activate of the expression of the transgene. . 
Moreover, expression of the conditional transgenes can also be induced by gene therapy-like 
methods wherein a gene encoding the trans-activating protein, e.g., a recombinase or a 
prokaryotic protein, is delivered to the tissue and caused to be expressed, such as in a cell- 
type specific manner. 

10 Transactors in these inducible or repressible transcriptional regulation systems 

are designed to interact specifically with sequences engineered into the transgene. Such 
systems include those regulated by tetracycline ("tot systems"), interferon, estrogen, 
ecdysone, Lac operator, progesterone antagonist RU486, and rapamycin (FK506) with tet 
systems being particularly preferred (see, e.g, Gingrich and Roder, 1998, Antra. Rev. 
15 Neurosci. 21: 377-405; incorporated herein by reference in its entirety). These drugs or 
hormones (or their analogs) act on modular transactors composed of natural or mutant 
Ugand-binding domains and intrinsic or extrinsic DNA binding and transcriptional 
activation domains. In certain embodiments, expression of the heterologous peptidecan be 
regulated by varying the concentration of the drug or hormone in medium in vitro or in the 
20 diet of the transgenic animal in vivo. 

In a preferred embodiment, the control elements of the tetracycline-resistance operon 
of E. coli is used as an inducible or repressible transactivator or transcriptional regulation 
system ("tet system") for conditional expression of the transgene. A tetracycline-controUed 
transactivator can require either the presence or absence of the antibiotic tetracycline, or one 
25 of its derivatives, e.g., doxycycline (dox), for binding to the tet operator of the tet system, 
and thus for the activation of the tet system promoter (Ptet). 

In a specific embodiment, atetracycline-repressed regulatable system (TrRS) is used 
(Agha-Molwmmadi and Lotze, 2000, J. Clin. Invest. 105(9): 1 177-83; Shockett et aL, 1995, 
Proc. Natl. Acad. Sci. USA 92: 6522-26; and Gossen and Bujard, 1992, Proc. Natl. Acad. 
30 Sci USA 89: 5547-51; incorporated herein by reference in their entireties). 

In another embodiment, a reverse tetracycline-controUed transactivator, e.g., rtTA2 
S-M2, is used. rtTA2 S-M2 transactivator has reduced basal activity in the absence 
doxycycline, increased stability in eukaryotic cells, and increased doxycycline sensitivity 
(Urlinger et aL, 2000, Proc. Natl. Acad Sci. USA 97(14): 7963-68; incorporated herein by 
35 reference in its entirety). In another embodiment, the tet-repressible system described by 
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Wells etal. (1999, Transgenic Res. 8(5): 371-81; incorporated herein by reference in its 
entirety) is used In one aspect of the embodiment, a single plasmid Tet-repressible system 
is used.. In another embodiment, the GAL4-UAS system (Ornitz et al., 1991, Proc. Natl. 
Acad. Sci. USA 88:698-702; Rowitch et al, 1999, J. Neuroscience 19(20):8954-8965; 
5 Wang etal, 1999, Proc. Natl. Acad. Sci. £A£4 96:8483-8488; Lewandoski, 2001, iVaft/re 
Reviews (Genetics) 2:743-755) or a GAL4VP16 fusion protein system (Wang et al., 1999, 
Proc. Natl. Acad Sci. USA 96:8483-8488) is used. 

In other embodiments, conditional expression of a transgene is regulated by using a 
recombinase system that is used to turn on or off the gene's expression by recombination in 
10 the appropriate region of the genome in which the potential drug target gene is inserted. 
The transgene is flanked by recombinase sites, e.g., FRT sites. Such a recombinase system 
can be used to turn on or off expression a transgene (for review of temporal genetic 
switches and "tissue scissors" using recombinases, see Hennighausen & Furth, 1999, Nature 
Biotechnol. 17: 1062-63). Exclusive recombination ina selected cell type may be mediated 
15 by use of a site-specific recombinase such as Cre, FLP-wild type (wt), FLP-L or FLPe. 
, Recombination may be effected by any art-known method, e.g., the method of Doetschman 
et al. (1987, Nature 330: 576-78; incorporated herein by reference in its entirety); the 
method of Thomas et al, (1986, Cell 44: 419-28; incorporated herein by reference in its 
entirety); the Cre-loxP recombination system (Sternberg and Hamilton, 1981, J. Mot Biol. 
20 ISO: 467-Z6;l^ etal., 1992, Proc. Natl Acad Sci. USA 89: 6232-36; which are both 
incorporated herein by reference in their entireties); the FLP recombinase system of 
Saccharomyces cerevisiae (O'Gorman et al., 1991, Science 251: 1351-55); the Cre-loxP- 
teliacycline control switch (Gossen and Bujard, 1992, Proc. Natl Acad Set USA 89: 5547- 
51, incorporated herein by reference in its entirety); and ligand-regulated recombinase 
25 system (Kellendonk et al., 1999, J. Mol. Biol 285: 175-82; incorporated herein by reference 
in its entirety). Preferably, the recombinase is highly active, e.g. , the Cre-loxP or the FLPe 
system, and has enhanced thermostability (Rodriguez et al, 2000, Nature Genetics 25: 139- 
40; incorporated herein by reference in its entirety). 

In a specific embodiment, the ligand-regulated recombinase system of Kellendonk et 
30 al. (1999, J. Mol. Biol. 285: 175-82; incorporated herein by reference in its entirety) can be 
used. In this system, the Ugand-binding domain (LBD) of a receptor, e.g., the progesterone 
or estrogen receptor, is fused to the Cre recombinase to increase specificity of the 
recombinase. 

In the case of an avian, a heterologous polypeptide or polypeptides encoded by the 
35 transgenic nucleic acid may be secreted into the oviduct lumen of the mature animal and 
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deposited as a constituent component of the egg white into eggs laid by the animal. It is 
also contemplated to be within the scope of the present invention for the heterologous 
polypeptides to be produced in the serum of a transgenic avian. 

A leaky promoter such as the CMV promoter may be operably linked to a transgene, 
5 resulting in expression of the transgene in all tissues of the transgenic avian, resulting in 
' production of, for example, immunoglobulin polypeptides in the serum. Alternatively, the 
transgene may be operably linked to an avian promoter that may express the transgene in a 
restricted range of tissues such as, for example, oviduct cells and macrophages so that the 
heterologous protein may be identified in the egg white or the serum of a transgenic avian. 
10 Transgenic avians produced by the sperm-mediated transection methods of the present 
invention will have the ability to lay eggs that contain one or more desired heterologous 
protein(s) or variant thereof. 

One embodiment of the present invention, therefore, is a transgenic avian produced 
by the sperm-mediated transfection methods of the present invention and having a 
15 heterologous polynucleotide sequence comprising a nucleic acid insert encoding a 

heterologous polypeptide and operably linked to an avian lysozyme gene expression control 
region, the gene expression control region comprising at least one 5' matrix attachment 
region, an intrinsically curved DNA region, at least one transcription enhancer, a negative 
regulatory element, at least one hormone responsive element, at least one avian CR1 repeat 
20 element, and a proximal lysozyme promoter and signal peptide-encoding region. 

Another embodiment of the present invention provides a transgenic avian further 
comprising a transgene with a lysozyme 3' domain. 

Accordingly; the invention provides transgenic avians produced by methods of the 
invention as described infra. In preferred embodiments, the transgenic avian contains a 
25 transgene comprising a heterologous peptide coding sequence operably linked to a promoter 
and, in certain embodiments, other regulatory elements. In more preferred embodiments, 
the transgenic avians of the invention produce heterologous proteins, preferably in a tissue 
specific manner, more preferably such that they are deposited in the serum and, most 
preferably, such that the heterologous protein is deposited into the egg, particularly in the 
30 egg white. In preferred embodiments, the transgenic avians produce eggs containing greater 
than 5 ug, 10 ug, 50 ug, 100 ug, 250 ug, 500 ug, or 750 ug, more preferably greater than 1 
mg, 2 mg, 5 mg, 10 mg, 20 mg, 50 mg, 100 mg, 200 mg, 500 mg, 700 mg, 1 gram, 2 grams, 
3 grams, 4 grams or 5 grams of the heterologous protein. In preferred embodiments, the 
transgenic avians produce an immunoglobulin molecule and deposit the immunoglobulin in 
35 the egg or serum of the avian, and preferably, the immunoglobulin isolated from the egg or 
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serum specifically binds its cognate antigen. The antibody so produced may bind the 
antigen with the same, greater or lesser affinity than the antibody produced in a mammalian 
cell, such as a myeloma or CHO cell. 

In specific embodiments, the transgenic avians of the invention were not produced 
5 or are not progeny of a transgenic ancestor produced using a eukaryotic viral vector, more 
particularly, not a retroviral vector (although, in certain embodiments, the vector may 
contain sequences derived from a eukaryotic viral vector, such as promoters, origins of 
replication, etc.). The transgenic avians of the invention include GO avians, founder 
transgenic avians, Gl transgenic avians, avians containing the transgene in the sperm or 
10 ova, avians mosaic for the tansgene and avians containing copies of the transgene in most 
or all of the cells. Contemplated by the invention are transgenic avians in which the 
transgene is episomal. In more preferred embodiments, the transgenic avians have the 
transgene integrated into one or more chromosomes. Chromosomal integration can be 
detected using a variety of methods well known in Ihe art, such as, but not limited to, 
15 Southern blotting, PCR, etc. 

6. VXAMPLES 

The present invention is further illustrated by the following examples. Each 
example is provided by way of explanation of Ihe invention, and is not intended to be a 

20 limitation of the invention. In fact, it will be apparent to those skilled in the art that vanous 
modifications, combination, additions, deletions and variations can be made in the present 
invention without departing from the scope or spirit of the invention. For instance, features 
illustrated or described as part of one embodiment can be used in another embodiment to 
yield a still further embodiment It is intended that the present invention covers such 

25 modifications, combinations, additions, deletions and variations as come within the scope of 
the appended claims and their equivalents. 

All references cited herein are incorporated herein by reference in their entirety and 
for all purposes to the same extent as if each individual publication, patent or patent 
30 application was specifically and individually indicated to be incorporated by reference in its 
entirety for all purposes. The citation of any publication is for its disclosure prior to the 
filing date and should not be construed as an admission that the present invention is not 
entitled to antedate such publication by virtue of prior invention. 



35 
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6.1 Example 1: Vectors Having Sperm-Specific Reporter Genes 

- The specific activity of spermatogenesis-specific promoters, such as the protamine 
promoter necessary for post-meiotic-specific transcription of this gene may be used to 
selectively mark those sperm cells that have inherited the transgene of interest after meiotic 
5 segregation. 

The construct contains two separate elements. In one example, the first element 
comprises an oviduct-specific promoter, such as that associated with a gene encoding 
ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin or ovomucin, The promoter 
is operatively linked to, and therefore drives the expression of a gene coding for a desired 

10 heterologous protein of interest, such as, but not limited to, a therapeutic protein like 
interferon, erythropoietin (EPO), or an immunoglobulin. 

The second element, which can be located either upstream or downstream from the 
first element, contains the protamine promoter, or any fragment thereof that is sufficient to 
drive the expression of a marker gene encoding a vital and color marker, such as the Green 

15 Fluorescent Protein (GFP). Those sperm cells that incorporate the transgene into their 
genomic DNA are vitally labeled during the late stages of spermiogenesis by the expression 
of the GFP protein. Given that the construct contains both the above first and the second 
elements, positive sperm cells also contain the transgene of interest 

Large numbers of positive sperm cells expressing the GFP protein are isolated using 

20 Fluorescent Activated Cell Sorting (FACS). Sperm cells selected on the basis of the 
expression of the incorporated marker gene are then used to breed hens by artificial 
iiisemination protocols. Suitable avian insemination protocols have been described by 
Etches (1996) Reprod. in Poultry (CAB International, Wallingford, UK), incorporated 
herein by reference in its entirety." In those cases where the number of positive sperm 

25 obtamedafterFACSisoktionistoolowformelikelmoodofsuccessfm 

insemination, the females may be fertilized by the intramagnal insemination method of 
Engel (1991) Poult Sci. 70:1965-1969 or Trefil (1996) Br. Poult Sci. 37:661-664, 
incorporated herein by reference in their entireties. Alternatively, small numbers of positive 
sperm cells are isolated under a microscope using UV light and then microinjected into 

30 unfertilized eggs via the Intracytoplasmic Sperm Injection (ICSI) protocols of Perry (1999), 
incorporated herein by reference in its entirety. 

62 Example 2: Lipofection Gene Transfer to Avian Oocytes 

(a) Isolation of the ovum: Donor hens were inseminated using the protocol for 
35 avian artificial iiisemination described by Etches (1996), incorporated herein by reference in 
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its entirety. Fertilized ova were collected from the magnum region of the oviduct of 
euthanized birds 1.5-3 hours after opposition. Alternatively, a hen whose oviduct is 
fistulated allows the collection of eggs for enucleation as taught by Gilbert and Woodgush, 
(1963, J. Reprod. Fertility 5: 451-453) and Pancer et al, (1989, Br. Poult. Sci. 30: 953-7). 
5 The thick albumen capsule surrounding the ovum was removed using spatulas and the ovum 
was placed in a well 48mm diameter and 23 mm in height containing Perry's salt solution 
(see Perry (1988), incorporated herein by reference in its entirety). 

(b) Preparation of lipofection solutions: Two lipofection solutions were used. The 
first solution comprised 50ug/ml of LIPOFECT AMINE™ (Gibco) pre-incubated for 1 hour 
1 0 with the restriction endonuclease Not I (500 Units Not I per ml of lipofection solution), and 
designated herein as 'Tjpofectamine/Not I solution". The second lipofection solution was 
composed of 50ug/ml of LIPOFECT AMINE™ F e-incubated for 1 hour with 500ug of 
peGFP linearized with Not I per ml oflipofection solution, herein described as 
"Lipofectamine/peGFP solution." Lipofectin-treated eggs were then incubated for 1 hour. 
15 (c) Gene transfer to avian oocytes by lipofection: The isolated ovum was then 

placed inside a glass conical chamber (Figure 1 A) so that the blastodisc was located in the 
center of a window that opens at the narrower end of the conical chamber. A 40 mm 
diameter and 8 mm high glass dish was used at the bottom of the cone to close the system. 
Perry salt solution was added to the bottom of the dish to prevent drying of the lower half of 
20 the ovum. The Perry's salt solution overlaying the blastodisc (accessed through the window 
opening of the cone) was then replaced by, for example, 100 uL of a lipofection solution 
described below. The eggs were incubated for 1 hour. Alternatively, egg incubation can be 
done by adding the lipofection solutions to the well and inverting the position of the 
incubation chamber (Figure IB), or by using a cloning cylinder around the blastodisc 
25 (Figure 1C). 

(d) Transfer of the lipofected egg: In a preferred embodiment, the ovum is 
surgically transferred into the oviduct of the recipient hen shortly after lipofection according 
to a described surgical procedure. (Tanaka, 1994, supra). The recipient hens are 
anesthetized by wing vein injection with pentobarbital (0.7 ml of a 68 mg/ml solution) or 

30 using gas anesthetics such as Isoflurane shortly after laying. During this window, the 

infundibulum is receptive to receiving a donor ovum but that has not yet ovulated. Feathers 
are removed from the abdominal area, the area is scrubbed with betadine and rinsed with 
70% ethanol. The bird is placed in a supine position and a surgical drape is placed over the 
bird exposing the surgical area. An incision is made beginning at the junction of the sternal 

35 rib to the breastbone and running parallel to the breastbone. The length of the incision is 
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approximately 6cm. After cutting through the smooth muscle layers and the peritoneum, 
the infundibulum is located. The infundibulum is externalized and opened using gloved 
hands. The donor ovum is gently placed in the open infundibulum. Gravity facilitates the 
movement of the ovum through the infundibulum and into the anterior magnum. The 

5 internalized ovum is placed into the body cavity and the incision closed using interlocking 
stitches both for the smooth muscle layer and the skin. The recipient hen is returned to her 
cage and allowed to recover witn free access to both feed and water. The hens resume 
normal activities after a post-operative recovery time of less than 45 minutes. Once 
transferred, the embryo develops inside the recipient hen and travels through the oviduct 

10 where it is encapsulated by natural egg white proteins and a natural eggshell. Eggs laid by 
the recipient hens are collected the next day, set, and incubated in a Jamesway incubator. 
The eggs hatch 21 days later. 

63 Example 3: Maintenance ofPlasmid Linearization in the Remi 

15 Procedure 

A plasmid that is to be integrated into the genomic nucleic acid of a sperm is 
linearized by cleavage with a selected restriction endonuclease. The linearized nucleic acid 
is then dephosphorylated at the exposed 5' ends of the newly formed cohesive regions by 
alkaline phosphatase treatment. Suitable protocols for the alkaline phosphatase 

20 dephosphorylation of nucleic acids are disclosed, for example, by Sambrook et aL , (supra), 
incorporated herein by reference in its entirety. 

While not wishing to be bound by any one theory, it is believed mat 
dephosphorylated cohesive ends of the nucleic acid may hybridize to recircularize the 
cleaved plasmid. Dephosphorylation of the 5' termini, however, prevent a DNA ligase from 

25 covalently rqoining a 5' terminus to the adjacent 3' terminus, thereby preventing a stable 
circular plasmid molecule from reforming. The cohesive ends of the non-ligated 
circularized plasmid may dissociate within a sperm cell to give a linearized nucleic acid that 
may integrate into the sperm genomic DNA. 

Alternatively, a circular plasmid having a heterologous nucleic acid that is to be 

30 integrated into the genomic nucleic acid of a sperm is digested with at least two different 
restriction endonucleases that generate a linearized plasmid having two non-cohesive ends, 
and wherein the desired transgenic element heterologous nucleic acid remains intact 
between the new tennini of the cleaved plasmid. The restriction endonucleases are selected 
to give dissimilar cohesive ends that cannot hybridize together to recircularize the cleaved 

35 plasmid The linearized nucleic acid is men delivered to the sperm with both of the 
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restriction endonucleases used to cleave the plasmid. The restriction endonucleases may be 
delivered to the sperm sequentially or simultaneously and combined, or sequentially 
delivered, with the cleaved plasmid. 

It can be advantageous, depending upon the positions of the endonuclease cleavage 
5 sites within the plasmid relative to the desired transgene, to use two different endonucleases 
that produce hybridizable cohesive ends. In this case, the 5' termini may also be 
dephosphorylated with alkaline phosphatase as described above, to prevent religation and 
stabilization of the cleaved plasmid. 

10 6.4 Example 4: Methods for Determing the SV40 Ori Requirement in SMT 

To determine the requirement for the SV40 origin of replication in sperm-mediated 
transgenesis, 5 \ig each of the plasmids pl083 (with the CMV promoter controlling heavy 
chain transcription) and pl086 (where the CMV promoter controls light chain transcription) 
were digested with Dra m which excises the SV40 origin of replication from the pl083 

15 plasmid while retaining the SV40 origin of replication of the pi 086 plasmid. For 

comparison, 5 fig each of the plasmids pl083 and pl086 were digested with the restriction 
endonuclease Mlu I that linearizes both plasmids while retaining the SV40 origin of 
replication in each of the respective plasmids. 

Digested plasmids were used to transfect sperm. In a polystyrene tube, Dra III- 

20 digested plasmids pl086 and pl083 (5 ng of each) were added to 100 fil of OPTTMEM™ 
medium (Life Technologies, Gaithersburg, MD) and 10 \ig of UPOFECTAMINE™ 
liposome (Life Technologies, Gaithersburg, MD). In a separate tube, 100 units of Dra HI 
restriction enzyme were added to 100 \il of OPTIMEM™ medium followed by 10 fig of 
UPOFECTAMINE™. The tubes were incubated at room temperature for 30 minutes, then 

25 added to freshly collected semen containing 10 9 chicken sperm (approximately 300 nl of 
semen). The sperm, DNA-liposome, and restriction enzyme-liposome mixture was 
incubated at room temperature for 30 minutes. 

Two White Leghorn hens were then artificially inseminated with 250 \d each of the 
transfection mixture. Eggs were collected for 7 days starting on the second day after 

30 fertilization, and set for hatch. Two weeks after hatch, serum samples were collected and 
assayed for human monoclonal antibodies by ELISA. The results are shown in Figure 10, 
wherein wing band number 3932 is the control. 
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6.5 Example 5: Gamma Irradiation of Chicken Sperm 

Exogenous, linearized DNA can be integrated into the genome of a recipient sperm 
cell by cleaving the double-stranded genomic DNA by gamma irradiation of the sperm prior 
to lipofection thereof with the transgenic nucleic acid. 
5 Wooster et al. found that rooster sperm irradiated with 12 Grays (Gy) of gamma 

irradianon resulted in about 43% residual fertility. (1977, Can. J. Genet. Cytol 19,437- 
446). Therefore, rooster semen will be irradiated with the following doses of gamma 
radiation: 0, 1, 5, 10, 15, and 20 Gy. A liposomal complex will consist of 10 ug of 
linearized DNA containing a promoter {e.g., CMV, ovalbumin, lysozyme, ovomucoid, 

10 ovotransferrin, conalbumin, and ovomucin, etc.) and transgene (e.g., IFN, erythropoeitin, 
human monoclonal antibody immunoglobulin heavy and light chains, and GM-CSF, etc.) 
andlOug of LJPOFECT AMINE™ (life Technologies, Gaithersburg, MD) will then be 
transfected into the irradiated sperm. After one hour, the irradiated and transfected sperm 
will be introduced into the hen by traditional artificial insemination procedures. Resulting 

15 laid eggs will be set and hatched, and transgene integration will be confirmed by Southern 
analysis of blood DNA 

6.6 Examples 6: Ovum Transfer to a Laying Hen 

At the time of laying, recipient hens are anesthetized by wing vein injection with 
20 pentobarbital (0.7 ml of a 68 mg/ml solution) or by a gaseous anesthetic such as Isoflurane. 
Pentobarbital is the preferred anesthetic. At this time, the mfundibulum is receptive to , 
receiving a donor ovum but has not yet ovulated. Feathers are removed from me abdominal 
area, and the area is scrubbed with betadine, and rinsed with 70% ethanol. The bird is 
placed in a supine position and a surgical drape is placed over the bird with the surgical area 

25 exposed. An mcision is made beginning*^ 

and running parallel to the breastbone. The length of the incision is approximately two 
inches. After cutting through the smooth muscle layers and the peritoneum, the 
mfundibulum is located The mfundibulum is externalized and opened using gloved hands 
and the donor ovum is gently applied to the open mfundibulum. The ovum is allowed to 

30 move into the mfundibulum and into the anterior magnum by gravity feed. The internalized 
ovum is placed into the body cavity and the incision closed using interlocking stitches both 
for the smooth muscle layer and the skin. The recipient hen is returned to her cage and 
allowed to recover with free access to both feed and water. Recovery time for the bird to be 
up, moving and feeding is usually within 45 minutes of the operation' s end. Eggs laid by 

35 
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therecipienthensarecoUectedtheiiextday.set.andincubated. They will hatch 21 days 
later. 

6.7 Example 7: Generation of Transgenic Chickens by Sperm-Mediated 
5 Transfection of Heterologous Nucleic Acid 

Plasmid pRC/CMV-EGFP ,10 pg, was added to 100 pi of OPTIMEM™ medium 
(Life Technologies, Gaithersburg, MD) and 10 pg of LIPOFECTAMENE™ (Life 
Technologies, Gaithersburg, MD) liposomes, in a polystyrene tube. In a separate tube, 100 
units of Dra ffl restriction enzyme was added to 1 00 pg of OPTIMEM™ medium followed 

10 by 10 pg of UPOFECTAMINE™. As negative controls, plasmids pl086 and pl083 were 
used for pRC/CMV-EGFP in the transfection mixture. Tubes were incubated at room 
temperature for 30 minutes, then added to 10 9 freshly collected chicken sperm 
(approximately 300 pi of sperm). The sperm, DNA-liposome, and restriction enzyme- 
liposome mixture was incubated at room temperature for 30 minutes. 

15 Two White Leghorn hens were inseminated with the transfection mixture, each hen 

receiving approximately 250 pi of the transfection mixture. Eggs were collected for 7 days 
starting on the second day after fertilization, and set for hatch. 

Four days after hatching, blood drops from chicks were collected from leg veins 
with heparinized capillary tubes and placed on microscope slides. Blood smears were 

20 viewed with FITC mumination with an inverted microscope (Olympus 1X70, 100 watt 
mercury lamp, HQ-FTTC Band Pass Emission filter cube, excitation 480/40 nm, emission 
535/50 nm, and 20X phase contrast objective). Auto-fluorescence was assessed using a 
TRITC filter (Olympus Modular B-MAX Filter cube, excitation 535/50 nm, emission 
610/75 nm). 

25 Two chicks that resulted from sperm transfected with pRC/CMV-EGFP had white 

blood cells showing green fluorescence. No fluorescence was seen when viewed with the 
TRTTC filter, indicating that the green fluorescence was not due to auto-fluorescence. None 
of the control chicks, derived from sperm transferred with control plasmids, had green 
fluorescence in their blood. 



30 



6.8 Example 8: Sperm-Mediated Transfection of Japanese Quail Ova 



Prophetic Example 

Japanese Quail hens will be artificially inseniinated with sperm transfected with 
vectors capable of expressing o-IFN, erythropoietin or amonoclonal antibody. ELISAs will 
35 be used to detect and measure the amount of an expressed transgene product in the animal's 
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serum and egg. As little as 15pg of o-interferon (a-IFN) or erythropoietin can be detected 
by this procedure. 

To prepare the quail flock for artificial insemination, the females will be separated 
from the males. Once the isolated females are no longer laying fertile eggs and the males 
5 are consistently producing sufficient semen, the birds will be used for artificial msemination 
(AJ) procedures. 

Sperm mediated transgenesis (SMT) of the quail will be performed with two 
plasmid vectors, pRC/CMV-IFNMM-SV40 and pRC/CMV-EPOMM-SV40. Transgenesis 
resulting in the integration of, and expression from, a heterologous nucleic acid encoding o- 

10 IFN has been used successfully in chickens with both viral-based and sperm-mediated 
transfer (SMT>based systems. The second vector will carry the gene encoding for 
erythropoietin. This protein requires more extensive post-translauonal modification, Le. 
four glycosylations, than does a-IFN. Both of the plasmid vectors will produce their 
respective expressed polypeptides in serum and in avo. Assaying for a-IFN or EPO 

15 production in serum will begin at two weeks of age and egg production will occur shortly 
thereafter. SMT will be performed with vectors having immunoglobulin heavy and light 
chain under the expression control of a lysozyme promoter. 

About 50 chicks will be obtained from the SMT-AL's. Based on results from our 
chicken SMT experiments, at least 2 to 4 transgenic quail for every 50 birds will be 

20 produced from the SMT-AL's. 

6.9 Example 9: Preparation of Female and Male Japanese Quails for 
Sperm-Mediated Transfection by Artificial Insemination 

The birds used will be selected for their optimal age for fertility, according to the 
25 average life history of the quail as shown in Table 1 . 

Table 1: Japanese Quail life history 
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Hatching m 


16-17 days 


Sexual maturity ! 




Females 




Under current conditions 


48 days 


Under optimal conditions 


35-38 days 


Males 


35-42 days 


Optimal Fertility 




Females 


60-240 day (8-34 weeks) 


Males 


60-280 davs f8-40 weeks) 
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10 



Declining r ertility 




Fertility declines 30-50% 


224 days (32 weeks) 


Gamete production cessation 




Females (eggs) 


1 S-9 vears 


Males Decreases 


1 ^ VftaT*5 




3 years 


Ceases _ 




Lifespan 




Females 


2.5-3 years 


Males 


3-5 years 



Females: Female Japanese Quail are separated from males and eggs inspected for 
fertilization over a 10 day period. Females will begin producing infertile eggs about 7 to 10 
days after removal from the males. 

Males: Males birds will be separated from females and conditioned for semen 
15 collection. The conditioning will be continued for 10 days. About 60% of males at their 
sexual peak will produce good semen, and consistently high volume semen producing birds 
will be progressively selected. 

(a) Semen Quality and Lipofection Optimization: The extracted quail sperm will be 
addedtoad^uentmatisataMgherpHthan>typicaUyusedwimcMc Quail 

20 sperm, compared to chicken sperm, require a higher pH to maintain motility once collected 
from the animal , as reported by Holm. L. & Wishart G J. in Animal Reprod. Sci. 54: 45-54 
(1998) and incorporated herein by reference in its entirety. A semen diluent having a pH of 
between about 8 and about 9 maintains motility better than does a pH of 7. 
Artificial insemination (A. I): Each hen will be artificial inseminated with a 25ul dose 

25 containing 2.5 x 10 7 sperm per hen. Hens will be divided into Group 1 : 4 females 
inseminated with semen only, Group 2: 4 females inseminated with semen treated with 
LIPOFECT AMINE™; Group 3: 4 females sperm-mediated transfected with pCMV-IFN- 
SV40; Group 4: 4 females sperm-mediated transfected with pCMV-EPO-SV40. 
Since the average fertility of hens after artificial insemination is about 4 days, the hens will 

30 be inseminated twice a week to ensure delivery of a fresh supply of transfected semen. 
Transgenic positive birds will be mated to produce G, chicks. The first three eggs of each 
bird will be screened for IFN, EPO or immunoglobulin polypeptide expression and the 
r emainin g eggs will be incubated to hatching. 

(b) Hatchling care: Newly hatched chicks will be grown at 105 0 F-l 10°F for the 
35 first 4-5 days. The temperature will then be reduced by 5°F after the first and second week 
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in 2°F increments. By the third week the house temperature will be sufficient A 16/8 
lighting schedule will also be used. 

6.10 Example 10: Quail Semen Collection 

5 The male bird is grasped so its breastbone rests in the palm of the right hand. The 

tail is positioned so the first two fingers of right hand lay on either side of the vent just 
below the legs. Holding the male in an almost vertical position, the left hand gently 
squeezes four times at the base of the cloaca to remove the foamy secretions of the glandvla 
proctodealis (foam gland). The vent is wiped to remove traces of the foamy substance and 
10 to prevent contamination of the semen. The left hand maintains firm pressure against the 
base of the cloaca and gently pulls back on dorsal proctodeal wall to achieve erection 
The first two fingers of the right hand gently massage the abdomen and apply 
moderate pressure just below the vent to force semen from the vas deferens into the 
copulatorv organ. The semen will appear shortly thereafter. The viscous, pale yellow to 
1 5 white semen is collected with a 20 ul pipette and immediately diluted with 1 50 mM NaCl 
and 2(hnMN-tris[Hydroxvmemyl]memyl-2-ammoeti^ acid (TES), at pH 8.0. 

6.11 Example 11: Lipofection of Quail Sperm 

' Quail semen will be diluted, immediately after harvesting, to a concentration of 10 8 
20 sperm/ml in 150 mM NaCl and 20mM N-Ms[Hydmxymemyl]memyl-2-anunoethane- 
sulfonic acid (TES), pH 8.0 buffer. Semen extender that is optimized for chicken sperm 
may not be used since it rapidly immobilizes quail semen within five minutes of contact 
The lipofection procedures used with quail sperm will be similar to those adopted 
for chicken lipofection, including REMI sperm mediated tranfections (SMT). With the 
25 chicken SMT procedure, artificial insonmation is with approximately 6 x 10 8 sperm. Due 
to the limited amount of semen produced by male quail lxl0 8 quail sperm will be used per 
hen The lowest number of sperm that will still gives maximum insemination will be 
adopted. Typically, the DNA (l.Oug), restriction enzyme, LIPOFECTAMINE™ (l.Oug) 
and sperm (10 s ) will be incubated together at a ratio of / respectively for 30 minutes. All 
30 reactions will be carried out in OptiMEM™ medium (Gibco-BRL, Gaithersburg, MD). 

6.12 Integration of Adeno-Associated Virus (Aav) Inverted Terminal 

Repeats-Flanked Genes Introduced by Sperm-Mediated Transgenesis 

The chromosomal integration of plasmid DNA into me genome of an avian cell will 
35 be mediated by flanking the gene of interest and sequences related to its expression, with 
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AAV inverted terminal repeat (TTR) sequences. A method for gene delivery and integration 
of heterologous nucleic acid sequences into the genomic DNA of a mammalian cell is 
described by Solis et al. in U.S. Patent Serial No. 5,843,742 incorporated herein by 
reference in its entirety. A nucleic acid segment will also be included with the gene of 
5 interest that will result in the expression of the AAV Rep protein within the same cell. 
For example, a plasmid nucleic acid vector containing an expression cassette 
consisting of a CMV immediate early promoter driving the expression of human 
erythropoetin, will be flanked by AAV ITR sequences. This plasmid will be introduced by 
sperm-mediate transgenesis into targeted host cells together with a second nucleic acid 

10 vector plasmid This second plasmid will include an expression cassette comprising the 
CMV immediate early promoter driving expression of the nucleic acid sequence encoding 
the AAV Rep 78 protein. Alternatively, a single nucleic acid vector comprising the 
expression cassette comprising the CMV immediate early promoter driving expression of 
the nucleic acid sequence encoding the AAV Rep 78 protein and the cassette expressing the 

15 gene of interest, such as erythropoetin, will be introduced together into an avian male gem 
cell. 

6.13 Example 13: DNA Construct Modification to Improve Germline 
Transmission of Trangenes 

20 Following genetic modification in vertebrates, a low percentage of offsprings 

derived from the founder animals are transgenic given the low number of germline cells that 
carry the transgene. As a result, costly and cumbersome breeding of the founder animals is 
required to expand the number of transgenic animals derived from the original founder 
animals. 

25 A number of articles (e.g., Peschon, 1989, Arm. NYAcadScL 564: 186-197; 

Peschon, 1987, PNAS 84: 5316-5319; Zambrowicz, 1993, PNAS 90: 5071; Braun, 1989, 
Gene Dev. 3:793-802; Rhim, 1995, Biol Reprod 52:20-32) as well as patent application(s) 
(O'Gorman et al., PCT Publication^. WO 99/10488) have identified and used the 
elements of me protamine promoter necessary for post-meibtic-specific transcription of this 

30 gene. Other spermiogenesis-specific promoters have also been described and used in the 
context of genetic manipulation (Sage, 1999, Mech. Dev. 80: 29-39; Vidal, 1998, Mol 
Reprod. Dev. 51: 274-280). In this example, we take advantage of the specific activity of 

• these promoters to selectively mark those sperm cells that have inherited the transgene of 
interest after meiotic segregation. 

35 



-75- 



WO 03/024199 



PCT/US02/30156 



In the example described here, the construct would contain two independent 
elements. In a preferred example, the first element would comprise an oviduct-specific 
promoter, such as ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and 
ovomucin. The promoter would drive expression of a gene coding for a protein of interest, 
5 such as a therapeutic protein like Interferon, erythroprotm (EPO). Alternatively, 
constitutive promoters such as CMV or RSV may also be used. 

The second element, located up or downstream from the first, would contain the 
protamine prompter, or a segment of this promoter that is sufficient to drive the expression 
of a marker gene. In a preferred example, the protamine promoter would drive the 
10 expression of a marker, preferably a vital and color marker, such as the Green Fluorescent 
Protein (GFP). In such example, those sperm cells that have inherited the transgene would 
be vitally labeled during the late stages of spermiogenesis with the expression of the GFP 
protein. Given that the construct used contains both the first and the second elements 
described above, positive sperm cells would also contain the transgene of interest 
15 Large numbers of positive sperm cells expressing Ihe GFP proteins could be isolated 

using Fluorescent Activated Cell Sorting (FACS). These sperm cells could subsequently be 
used to breed hens by described artificial insemination protocols. (Etches, 1996, Mol. 
Reprod. Dev. 45:2918). In cases where the number of positive sperm after FACS isolation 
is low and insufficient for AL the females could be bred through intramagnal insemination. 
20 (EngeL 1991, Poultry ScL 70: 1965; Trefil, 1996, Br. Poult. Set 37: 661-664). 

Alternatively, small numbers of positive sperm cells could be isolated under a microscope 
using uv fight and injected into unfertilized eggs via described Intracytoplasmic Sperm 
Injection QCSI) protocols. (Perry, 1999, Science 284: 1180-83). 

25 6.14 Example 14: Use of Chicken Centromeric and Telomeric Sequences to 

Create a Chicken Artificial Chromosome (ChAC) 

The Shemesh et al. procedure (2000, Molecular Reproduction and Development 56: 
306-308) for introducing linearized plasmid DNA into chicken sperm appears to rely on 
vector sequences which include an SV40 origin of replication. It is possible that the 

30 exogenous DNA therefore replicates as an episome and would most likely be lost in 
subsequent cell divisions due to improper segregation at mitosis. To insure proper 
segregation at mitosis, chicken centromere and telomere sequences could be included in the 
transgenic construct Chicken centromere and telomere sequences could be obtained on a 
BAC (bacterial artificial chromosome) library clone from Texas A&M University or Martin 

35 GroenenatWagenmgenAgriculturalUmversity,TheNem^ The SV40 origin of 
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replication and the promoter (i.e. CMV, ovalbumin, lysozyme, ovomucoid, ovotransferrin, 
conalbumin, and ovomucin, etc.) and transgene IFN, EPO, human monoclonal antibody 
heavy and light chains, GM-CSF, etc.) combination could be cloned into the BAC clone 
containing the chicken centromere. This BAC would therefore contain an origin of 
replication, a centromere, telomere, and the pmmoter/transgene combination which could be 
transfected into sperm with the Shemesh procedure. Due to the chicken centromere and 
telomeres, the construct would replicate and segregate as a chicken artificial chromosome 
(ChAC). 



10 



6.15 Example 15: Construction of Lysozyme Promoter Plasmids 

The chicken lysozyme gene expression control region was isolated by PCR 
amplification. Ligation and reamplification of the fragments Ihereby obtained yielded a 
contiguous nucleic acid construct comprising the chicken lysozyme gene expression control 
region operably linked to a nucleic acid sequence optimized for codon usage in the chicken 
15 (SEQ ID NO: 5) and encoding a human interferon a2b polypeptide optimized for expression 
in an avian cell. 

White Leghorn Chicken (Callus gallus) genomic DNA was PCR amplified using the 
primers 5pLMAR2 (SEQ ID "NO: 1) and LE-6.1kbrevl (SEQ ID NO: 2) in a first reaction, 
and Lys-6.1 (SEQ ID NO: 3) and LysElrev (SEQ ID NO: 4) as primers in a second reaction. 

20 PCR cycling steps were: denaturation at 94°C for 1 minute; annealing at 60°C for 1 minute; 
extension at 72°C for 6 minutes, for 30 cycles using TAQ PLUS PRECISION DNA 
polymerase (STRATAGENE®, LaJolla, CA). The PCR products from these two reactions 
were gel purified, and then united in a third PCR reaction using only 5pLMAR2 (SEQ ID 
NO: 1) and LysElrev (SEQ ID NO: 4) as primers and a 10-minute extension period The 

25 resulting DNA product was phosphorylated, gel-purified, and cloned into the £coR V 
restriction site ofthe vector pBluescript® KS, resulting inthe plasmidpl2.0-lys. 

P 12.0-lys was used as a template in a PCR reaction with primers 5pLMAR2 (SEQ 
E)NO:l)andLYSBSU 

(5'-CCCCCCCCTAAGGCAG<:CAGGGGCAGGAAGCAAA-3') (SEQ ID NO: 5) and a 10 
30 minute extension time. The resulting DNA was phosphorylated, gel-purified, and cloned 

into the EcoR V restriction site of pBLUESOUPT® KS, forming plasmid pl2.01ys-B. 

pl2.01ys-B was restriction digested with Not I and Bsu36 L gel-purified, and cloned 

into Not I and Bsu36 1 digested pCMV-LysSPlFNMM, resulting in pl2.0-lys-LSPIFNMM. 

pl2.0-lys-LSPIFNMM was digested with Sal I and the SaUtoNotI primer (5'- 
35 TCGAGCGGCCGC-3') (SEQ ID NO: 13) was annealed to the digested plasmid, followed 
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by Not I digestion. The resulting 12.5 kb Not I fragment, comprising the lysozyme promoter 
region linked to IFNMAGMAX-encoding region and an SV40 polyadenylation signal 
sequence, was gel-purified and ligated to Not I cleaved and dephosphorylated • 
pBluescript® KS, thereby forming the plasmid pAVDCR-Al 15.93.1 2, which was then 
5 sequenced. 

6.16 Example 16: Construction of Plasmids Which Contain the 3* Lysozyme 
Domain 

The plasmid pAVDCR-Al 15.93.1 .2 (containing the -12.0 kb lysozyme promoter 

10 controlling expression of human interferon a2b) was purified with a QIAGEN® Plasmid 
Maxi Kit (QIAGEN®, Valencia, CA), and 100 ug of the plasmid were restriction digested 
with NoA restriction enzyme. The digested DNA was phenol/CHCl 3 extracted and ethanol 
precipitated. Recovered DNA was resuspended in ImM Tris-HCl (pH 8.0) and O.lmM 
EDTA, then placed overnight at 4°C. DNA was quantified by spectrophotometry and 

15 diluted to the appropriate concentration. The DNA samples were bound to the SV40 T 
antigen NLS peptide by incubation for 15 minutes. 

The plasmid pAVDCR-Al 15.93.12 was restriction digested with Fsel and blunt- 
ended with T4 DNA polymerase. The linearized, blunt-ended pAVUCR-Al 15.93.12 
plasmid was then digested with^ol restriction enzyme, followed by treatment with 

20 alkaline phosphatase. The resulting 15.4 kb DNA band containing the lysozyme 5* matrix 
attachment region (MAR) and -12.0 kb lysozyme promoter driving expression of a human 
interferon was gel purified by electroelution. 

The plasmid pIHilys was restriction digested with MluL, then blunt-ended with the 
Klenow fragment of DNA polymerase. The linearized, blunt-ended plffilys plasmid was 

25 digested withjftol restriction enzyme and the resulting 6 kb band containing the 3' 
lysozyme domain from exon 3 to the 3' end of the 3' MAR was gel purified by 
electroelution. The 15.4kbbandfrompAVDCR-A115.93.1.2andthe6kbbandfrom 
plffilys were ligated with T4 DNA ligase and transformed into STBL4 cells (Invitrogen Life 
Technologies, Carlsbad, CA) by electroporation. The resulting 21.3 kb plasmids from two 

30 different bacterial colonies were named pAVUCR-A212.89.2.1 and pAVDCR-A212.89.2.3 
respectively. 



35 
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6.17 Example 17: Construction of an ALV-based Vector Having p-lactamase 
Encoding Sequences 

The lacZ gene of pNLB, a replication-deficient avian leukosis virus (ALV>based 
vector (Cosset et al.,J. Virol. 65: 3388-94 (1991)), was replaced with an expression cassette 
5 consisting of a cytomegalovirus (CMV) promoter and the reporter gene ^-lactamase (fi-La 
or 51). 

To efficiently replace the lacZ gene ofpNLB with a transgene, an intermediate 
adaptor plasmid was first created, pNLB-Adapter. pNLB-Adapter was created by inserting 
the chewed back ApaVApal fragment of pNLB (Cosset et al., 1991, J. Virol. 65:3388-94) 

10 (in pNLB, the 5' Apal sites reside 289 bp upstream of lacZ and the 3 ' Apal sites reside 3' of 
the 3' LTR and Gag segments) into the chewed-back KpnVSael sites of pBluescript®KS(- 
). The filled-in MuVXbal fragment of pCMV-BL (Moore et al., Anal. Biochem. 247: 203-9 
(1997)) was inserted into the chewed-back KpnVNdel sites of pNLB-Adapter, replacing 
lacZ with the CMV promoter and the BL gene (in pNLB, Kpri. resides 67 bp upstream of 

15 lacZ and Ndel resides 100 bp upstream of the lacZ stop codon), thereby creating pNLB- 
Adapter-CMV-BL. To create pNLB-CMV-BL, the Hin<ffl/Blpl insert of pmB (containing 
lacZ) was replaced with the HindEUBlpl insert of pNLB-Adapter-CMV-BL. This two step 
cloning was necessary because direct ligation of blunt-ended fragments into the HindHI/Blpl 
sites of pNLB yielded mostly rearranged subclones, for unknown reasons. - 

20 

6.18 Example 18: Production ofTransduction Particles Having an ALV- 
based [Vector Having p-lactamase Encoding Sequences 

Sentas and Isoldes were cultured in F10 (GBCO®), 5% newborn calf serum 
(GIBCO®), 1% chicken serum (GBCO®), 50 ug/ml phleomycin (Cayla Laboratories) and 

25 50 ug/ml hygromycin (SIGMA®). Transduction particles were produced as described in 
Cosset et al., 1991, herein incorporated by reference, with the following exceptions. Two 
days after transfection of the retroviral vector pNLB-CMV-BL (from Example 10, above) 
into 9 x 10 5 Sentas, virus was harvested in fresh media for 6-16 hours and filtered. All of 
the media was used to transduce 3 x 1 0 6 Isoldes in three 100 mm plates with polybrene 

30 added to a final concentration of 4 ug/ml. The following day the media was replaced with 
media containing 50 ug/ml phleomycin, 50 ug/ml hygromycin and 200 ug/ml G418 
(SIGMA®). After 10-12 days, single G418 r colonies were isolated and transferred to 24- 
well plates. After 7-10 days, titers from each colony was determined by transduction of 
Sentas followed by G418 selection. Typically 2 out of 60 colonies gave titers at 1-3 x 10 5 . 

35 Those colonies were expanded and the virus concentrated to 2-7 x 10 7 as described in 
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Allioli et aL, 1994, Dev. Biol 165:30-7, herein incorporated by reference. The integrity of 
the CMV-BL expression cassette was confirmed by assaying for (J-lactamase in the media of 
cells transduced with NLB-CMV-BL transduction particles. 

5 6.19 Example 19: pNLB-CMV-IFN Vector Having an IFN Encoding 

Sequence 

The DNA sequence for human interferon a2b based on hen oviduct optimized codon 
usage was created using the BACKTRANSLATE program of the Wisconsin Package, 
version 9.1 (Genetics Computer Group. Inc., Madison, WI) with a codon usage table 
10 compiled from the chicken (Gallus gallus) ovalbumin, lysozyme, ovomucoid, and 

ovotransferrin proteins. The template and primer oligonucleotides (SEQ ID NOS: 14-31) 
shown in Figures 8 A-B were amplified by PCR with Pfu polymerase (STRATAGENE® , La 
Jolla, CA) using 20 cycles of 94°C for 1 min., 50°C for 30 sec, and 72°C for 1 min. and 10 
sec. 

15 PCR products were purified from a 12% polyacrylamide-TBE gel by the "crush and 

soak" method (Maniatis et aL 1982), then combined as templates in an amplification 
reaction using only IFN-1 (SEQ ID NO: 21) and EFN-8 (SEQ ID NO: 31) as primers. The 
resulting PCR product was digested with Hind m and Xba I and gel purified from a 2% 
agarose-TAE gel, then ligated into Hind m and Xba I digested, alkaline phqsphatase-treated, 

20 pBLUESCRlPT® KS (STRATAGENE®), resulting in the plasmid pBluKSP-EFNMagMax. 
Both strands were sequenced by cycle sequencing on an ABI PRISM 377 DNA Sequencer 
(Perkin-Elmer, Foster City, CA) using universal T7 or T3 primers. Mutations in pBluKSP- 
IFN derived from the original oligonucleotide templates were corrected by site-directed 
mutagenesis with the Transformer Site-Directed Mutagenesis Kit (Clontech, Palo Alto, 

25 CA). The interferon coding sequence was then removed from the corrected pBluKSP-IFN 
with Hind ffl and Xba 1, purified from a 0.8% agarose-TAE Gel, and ligated to Hind IH and 
Xba I digested, alkaline phosphatase-treated P CMV-BetaLa-3B-dH. The resulting plasmid 
was pCMV-EFN which contained IFN coding sequence controlled by the cytomegalovirus 
immediate early promoter/enhancer and SV40 polyA site. 

30 To clone the IFN coding sequence controlled by the CMV promoter/enhancer into 

the NLB retroviral plasmid, pCMV-IFN was first digested with Clal and J&al, then both 
ends were filled in with Klenow fragment of DNA polymerase (New England BioLabs, . 
Beverly, MA). pNLB-adapter was digested with Nde I and Kpn I, and both ends were made 
blunt by T4 DNA polymerase (New England BioLabs). Appropriate DNA fragments were 

35 
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purified on a 0.8% agarose-TAE gel, then ligated and transformed into DH5o cells. The 
resulting plasmid was pNLB-adapter-CMV-IFN. 

This plasmid was then digested with Mlu I and partially digested with Blp I and the 
appropriate fragment was gel purified. pNLB-CMV-EGFP was digested with Mlu I and Blp 
5 I, then alkaline-phosphatase treated and gel purified. The Mlu VBlp I partial fragment of 
pNLB-adapter-CMV-IFN was ligated to the large fragment derived from the Mlu VBlp I 
digest of pNLB-CMV-EGFP, creating pNLB-CMV-IFN. 

6 20 Example 20: Production of pNLB-CMV-IFN Transduction Particles 

1 0 Senta packaging cells (Cosset et cd. , 1991) were plated at a density of 3 x 10 5 

cells/35 mm tissue culture dish in F-10 medium (Life Technologies) supplemented with 
50% calf serum (Atlanta Biologicals), 1% chicken serum (Life Technologies), 50 ug/ml 
hygromycin (SIGMA®), and 50 ug/ml phleomycin (CAYLA, Toulouse, Fiance). These 
cells were transfected 24h after plating with 2 ug of CsCl-purified pNLB-CMV-IFN DNA 

15 and 6 ul of Lipofectin liposomes (Life Technologies) ina final volume of 500 ulOptimem 
(Life Technologies). The plates were gently rocked for four hours at 37° C in a 5% C0 2 
incubator. For each well, the media was removed, washed once with 1 ml ofOptimem and 
re-fed with 2 mis of F-10 medium supplemented wim 50% calf serum, 1% chicken serum, 
50 Mg/ml hygromycin, and 50 j*g/ml phleomycin. The next day, medium from transfected 

20 Sentas was recovered and filtered through a 0.45 micron filter. 

This medium was then used to transduce Isolde cells. 0.3 ml of the filtered medium 
recovered from Senta cells was added to 9.6 ml of F-10 (Life Technologies) supplemented 
as described above, in addition to polybrene (SIGMA®) at a final concentration of 4 ug/ml. 
This mixture was added to 10 6 Isolde packaging cells (Cosset etal., 1991) plated on a 

25 1 00mm dish the previous day, then replaced with fresh F-10 medium (as described for Senta 
growth) 4 hours later. 

The next day, the medium was replaced with fresh medium which also contained 
200 ug/ml neomycin (G418, SIGMA®). Every other day, the medium was replaced with 
fresh F-10 medium supplemented with 50% calf serum, 1% chicken serum, 50 /zg/ml 

30 hygromycin, 50 /zg/ml phleomycin, and 200 A^g/ml neomycin. Seven to twelve days later, 
single colonies were visible by eye, and these were picked and placed into 24 well dishes. 
When some of the 24 well dishes became confluent, medium was harvested and titered to 
determine the cell lines with the highest production of retrovirus. 

Titering was performed by plating 7.5 x 10 4 Senta cells per well in 24 well plates on 

35 the day prior to viral harvest and transduction, The next day 1 ml of fresh F-10 medium 
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supplemented with 50% calf serum, 1% chicken serum, 50 fig/mi hygromycin, and 50 
^g/ml phleomycin was added to each well of the isolated Isolde colonies. Virus was 
harvested for 8-10 hours. The relative density of each well of Isoldes was noted. After 8-10 
hours, 2 and 20Atl of media from each well of Isoldes was added directly to the media of 

5 duplicate wills of the Sentas. Harvested medium was also tested for the presence of 
interferon by IFN ELISA and for interferon bioreactivity. The next day the media was 
replaced with F-10 medium supplemented with 50% calf serum, 1% chicken serum, 50 
Mg/ml hygromycin, 50 /zg/ml phleomycin, and 200 Mg/ml neomycin. When obvious 
neomycin-resistant colonies were evident in the wells of transduced Sentas, the number of 

10 colonies was counted for each well. 

The Isolde colony producing the highest titer was determined by taking into account 
the number of colonies and correcting for the density of the Isolde cells when the viral 
particles were harvested (Le., if two Isolde colonies gave rise to media with the same titer, 
but one was at a 5% density and the other was at a 50% density at the time of viral harvest, 

15 me one at the 5% density was chosen for further work, as was the case in the present 
example). 

The Isolde cell line producing the highest titer of IFN-encoding transducing particles 
was scaled up to six T-75 tissue culture flasks. When flasks were confluent, cells were 
washed with F-10 medium (unsupplemented) and transducing particles were then harvested 

20 for 16 hours in 14 ml/flask of F-10 containing 1% calf serum (Atlanta Biologicals) and 02% 
chicken serum (Life Technolocyies). Medium was harvested, filtered through a 0.45 micron 
syringe filter, then centrifuged at 195,000xg in a Beckman 60Ti rotor for 35 min. Liquid 
was removed except for 1 ml, and this was incubated with the pellet at 37°C with gentle 
shaking for one hour. Aliquots were frozen at -70°C. Transducing particles were then 

25 titered on Senta cells to determine concentrations used to inject avian sperms. 

621 Example 21: Construction of Lysozyme Promoter Plasmids 

The chicken lysozyme gene expression control region isolated by PCR amplification 
is fully disclosed in U.S. Patent Application Serial No. 09/922,549, filed August 3, 2001 

30 and incorporated herein by reference in its entirety. Ligation and reamplification of the 
fragments thereby obtained yielded a functionally contiguous nucleic acid construct 
comprising the chicken lysozyme gene expression control region operably linked to a 
nucleic acid sequence encoding a human interferon a2b polypeptide and optimized for 
codon usage in the chicken. Briefly, chicken (Gallus gallus (White Leghorn)) genomic 

35 DNA was PCR amplified using the primers 5pLMAR2 and LE-6.1kbrevl in a first 
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reaction, and Lys-6.1 and LysElrev as primers in a second reaction. PCR cycling steps 
were: denaturation at 94°C for 1 minute; annealing at 60°C for 1 minute; extension at 72°C 
for 6 minutes, for 30 cycles using TAQ PLUS PRECISION™ DNA polymerase 
(STRATAGEMS®, La Jolla, CA). The PCR products from these two reactions were gel 

5 purified, and then united in a third PCR reaction using only 5pLMAR2 and LysElrev as 
primers and a 10 minute extension period. The resulting DNA product was phosphorylated, 
gel-purified, and cloned into the EcdR. V restriction site of the vector pBLUESCRIPT® KS, 
resulting in the plasmid pl2.0-lys. 

pl2.0-lys was used as a template in a PCR reaction with primers 5pLMAR2 and 

10 LYSBSU and a 10 minute extension time. The resulting DNA was phosphorylated, gel- 
purified, and cloned into the EcdR V restriction site of PBLUESCRIPT® KS, forming plasmid 
pl2.01ys-B. 

pl2.01ys-B was restriction digested with Not I and Bsu36 L gel-purified, and cloned 
into Not I and Bsu36 1 digested pCMV-LysSPIFNMM, resulting in pl2.0-lys-LSPIFNMM. 
15 P 12.0-lys-LSPIFNMM was digested with Scd I and the SalltoNotI primer was annealed to 
the digested plasmid, followed by Not I digestion. The resulting 12.5 fcb Not I fragment, 
comprising the lysozyme promoter region linked to IFNMAGMAX-encoding region and an 
SV40 polyadenylation signal,sequence, was gel-purified and ligated to Not I cleaved and 
dephosphorylated pBLUESCRIPT® KS, thereby forming the plasmid pAVTJCR-Al 15.93.12. 

20 

622 Example 22: Complete Lysozyme Promoter and IFNMAGMAX 
Sequences 

The complete sequences of die lysozyme gene promoter and the codon-optimized 
human interferon a2b nucleic acid are fully disclosed in U.S. Patent Application No. 

25 09/922,549, filed 03 August 2001 and incorporated herein by reference in its entirety. The 
complete nucleotide sequence of the approximately 12.5 kb chicken lysozyme promoter 
region/IFNMAGMAX construct spans the 5' matrix attachment region (5' MAR), through 
the lysozyme signal peptide, to the sequence encoding the gene IFNMAGMAX and the 
subsequent polyadenylation signal sequence. The IFNMAGMAX nucleic acid sequence 

30 had been synthesized as described in Example 21 above. The expressed IFN o2b sequence 
within plasmid pAVDCR-Al 15.93.1.2 functioned as a reporter gene for lysozyme promoter 
activity. This plasmid construct may also be used for production of interferon o2b in the 
egg white of transgenic chickens. 
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6.23 Example 23: Synthesis of the MDOT promoter construct 

Amplification of the ovomucoid and ovotransferrin promoter sequences 

Oligonucleotide primers 1 (SEQ ID NO: 32) and 2 (SEQ ID NO: 33), as shown in 
Figure 9 were used to amplify the ovomucoid sequences. OUgonucleotide primers 3 (SEQ 

5 ID NO: 34) and 4 (SEQ ID NO: 35) were used to amplify the ovotransferrin sequence by 
PCR The primers were designed such that the PCR-amplified ovomucoid sequences 
contained an Xho I restriction cleavage site at Ihe 5' end and a Cla I site at the 3 ' end. 
Similarly, the PCR-amplified ovotransferrin product had a Cla I restriction site at the 5' end 
and a Hind m site at the 3' end. The overlapping Cla I site was used to splice the two-PCR 

10 products to create the MDOT promoter construct The nucleic acid sequence SEQ ID NO: 
1 1 of the MDOT promoter construct is shown in Figure 1 1. The final product was cloned in 
a bluescript vector between the Poland Hind m sites. From the bluescript vector the 
promoter region was released by Kpn VHind ffl restriction digestion and cloned into the prc- 
CMV-IFN vector to replace the CMV promoter to create MDOT-IFN (clone #10). This 

15 plasmid was tested in vitro. 

624 Example 24: Testicular Injection 

5 weeks old "White Leghorn male chickens were anesthetized using Isoflourane. 
Small incision was made between the last two ribs to expose the testes. A 5-10 ul virus 
20 suspension of pLNHX-CMV-EGFP/VSVg (9 x 10 6 per ml) was injected into either both 
testes or only one of the testes. 

At 20 weeks of age, semen samples were collected. Only one bird had sperm in his 
semen. Genomic DNA was isolated from the semen and used to amplify the transgene 
(CMV-EGFP) by PCR reaction using different DMSO concentrations. The samples were 
25 separated on agarose gel, transferred onto nitrocellulose membrane and hybridized with 
EGFP probe. As shown in Figure 1 1, EGFP positive bands are detected at two different 
DMSO concentrations suggesting that (1) specific PCR conditions are required for the 
amplification of the transgene and (2) the sperm samples have incorporated the transgene in 
their genome. 

30 

ff. QTTTVALENTS 

Reference now will be made in detail to the various embodiments of the invention, 
one or more examples of which are illustrated in the accompanying drawings. Each 
example is provided by way of explanation of the invention, not limitation of the invention. 
35 In fact, it will be apparent to those skilled in the art that various modifications, 
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combinations, additions, deletions and variations can be made in the present invention 
without departing from the scope or spirit of the invention. For instance, features illustrated 
or described as part of one embodiment can be used in another embodiment to yield a still 
further embodiment It is intended that the present invention covers such modifications, 
5 combinations, additions, deletions and variations as fall within the scope of the appended 
claims and their equivalents. 
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What is Cla imed Ts: 

1 . A method of generating a transgenic avian zygote by sperm-mediated transfection, 

said method comprising: 

(a) obtaining a suspension of avian male germ cells selected from the group 
5 consisting of spermatozoa and spermatozoal precursor cells; 

(b) introducing a nucleic acid comprising a transgene comprising a nucleotide 
sequence encoding a heterologous polypeptide to the avian male germ cells 
by lipofection, electroporation or restriction enzyme mediated integration; 

(c) delivering the avian male germ cells having the nucleic acid to an avian 
10 oocyte, r r 

thereby generating a transgenic avian zygote having the nucleic acid incorporated therein. 

2. The method of Claim 1, wherein the avian male geim cells and the avian oocyte are 
obtained from a chicken. 

15 

3. The method of Claim 1, wherein the avian male germ cells and the avian oocytes are 
obtained from a quail. 

4. The method of Claim 1, wherein the nucleotide sequence encoding said 

20 beterologouspolypeptide is operably linked to a transcriptional regulatory element that can 
direct gene expression in one or more cells of said transgenic avian. 

5. The method of Claim 4, wherein the transcriptional regulatory element is selected 
from the group consisting of the promoter regions of the avian genes encoding ovalbumin, 

25 lysozyme, ovomucoid, ovomucin, conalbumin and ovotransferrin. 

6. The method of Claim 5, wherein the selected nucleic acid further comprises a 
chicken lysozyme gene expression controlling region comprising the nucleotide sequence of 
SEQIDNO:7. 

30 

7. The method of Claim 4, wherein the transcriptional regulatory element is a tissue 
specific promoter. 

8. The method of Claim 7, wherein the tissue specific promoter is specific for the 

35 magnum. 
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9. The method of Claim 1, wherein the transgene comprises at least one 
cytomegalovirus promoter. 

1 0. The method of C laim 9, wherein the transcriptional regulatory element comprises at 
5 least two regions derived from the promoter of an avian gene, said regions being from a 

different promoter. 

11. The method of Claim 10, wherein the transcriptional regulatory element has the 
nucleotide sequence of SEQ ID NO: 11. 

10 

12. The method of Claim 1, wherein the transgene comprises at least one matrix 
attachment region (MAR). 

13. The method of Claim 12, wherein the transgene comprises a 5' MAR and a 3' MAR 
15 which flank said nucleotide sequence. 

14. The method of Claim 1, wherein the heterologous polypeptide is selected from the 
group consisting of a cytokine, a hormone, an enzyme, a structural polypeptide, and an 
immuoglobulin polypeptide. 

20 

15. The method of Claim 14, wherein the cytokine is selected from the group consisting 
of interferon, interleukin, granulocyte colony-stimulating factor, granulocyte-macrophage 
colony-stimulating factor, stem cell factor, erythropoietin, thrombopoietin, and stem cell 
factor. 

25 

16. The method of Claim 1 5, wherein the cytokine is an interferon. 

1 7. The method of Claim 1 , wherein the transgene comprises an internal ribosome entry 
site (IRES). 

18. The method of Claim 17, wherein the transgene comprises at least two nucleotide 
sequences each encoding a heterologous polypeptide. 

1 9. The method of Claim 1 8, wherein the ai least two nucleotide sequences encode at 
35 least two heterologous peptides that form a multimeric protein. 
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20. The method of Claim 19, wherein the multimeric protein specifically binds a 
selected ligand. 

21. The method of Claim 20, wherein the multimeric protein is an antibody. 

5 

22. The method of Claim 1, wherein the heterologous polypeptide comprises a peptide 
region suitable for the isolation of the heterologous polypeptide. 

23 . The method of Claim 1 , wherein the nucleic acid is a eukaryotic viral vector. 

10 

24. The method of Claim 23, wherein the eidcaiyotic viral vector is derived from any of 
the group consisting of avian leukosis virus, adenovirus, transferrin-polylysine enhanced 
adenoviral vectors, human immunodeficiency virus vectors, lentiviral vectors, and Moloney 
murine leukemia virus-derived vectors. 

15 

25. The method of Claim 1 , wherein the nucleic acid is a plasmid vector. 

26. The method of Claim 1 , wherein the nucleic acid is a bacterial artificial chromosome 
(BAC). 

20 

27. The method of Claim 1, wherein the nucleic acid is not a eukaryotic viral vector. 

28. The method of Claim 4, wherein the transcriptional regulatory element is a 
25 regulatable promoter. 

29. The method of Claim 6, wherein the selected nucleic acid further comprises a region 
encoding the 3' region of the chicken lysozyme gene and having the nucleotide sequence of 
SEQIDNO: 9. 

30 

30. The method of Claim 1 , wherein the nucleotide sequence encoding said 
heterologous polypeptide comprises an origin of replication. 

>. • - 

3 L The method of Claim 30, wherein the origin of replication is the SV40 origin of 
35 replication. 
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32. The method of Claim 1 , wherein the nucleic acid is selected from the group 
consisting of a linear nucleic acid, a plasmid, a viralnucleic acid, and an artificial 
chromosome. 

5 33. The method of Claim 32, wherein the artificial chromosome further comprises a 
centromere and optionally a telomere. 

34. The method of C laim 32, wherein the linear nucleic acid has at least one cohesive 
end characterized by the cohesive end generated by a restriction endonuclease. 

10 

35. The method of Claim 32, wherein the linear nucleic acid has at least one blunt end. 

36. The method of Claim 34, wherein the at least one cohesive end is generated by 
chemical synthesis. 

15 

37. The method of Claim 34, wherein the at least one cohesive end is generated by an 
enzyme other than a restriction endonuclease. 

38. The method of Claim 34, wherein the at least one cohesive end is generated by a 
20 combination of chemical and enzymatic methods. 

39. The method of Claim 1, wherein the nucleic acid is introduced to the avian male 
germ cells by restriction enzyme mediated integration. t 

25 40. ThememodofClaim39 s furmercomprismgmestepofdeUveru^ 

germ cells a restriction endonuclease capable of cleaving the genomic nucleic acid of the 
avian male germ cells. 

41. The method of Claim 40, wherein the nucleic acid is delivered sequentially with the 
30 restriction endonuclease to the avian male germ cells. 

42. The method of Claim 1, wherein the nucleic acid is delivered to the avian male germ 
cells by adeno-associated virus-derived vector. 
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43 . The method of Claim 42, wherein the nucleic acid is bounded by inverted terminal 
repeat sequences. 

44. The method of Claim 42, wherein the nucleic acid is bounded by inverted terminal 
5 repeat sequences derived from an adeno-associated virus-derived vector. 

45. The method of Claim 42, wherein the adeno-associated virus-derived vector further 
comprises a transcription cassette capable of expressing an adeno-associated virus Rep 
protein. 

10 

46. The method of C laim 45, wherein the Rep protein is Rep 78 . 

47. The method of Claim 45, wherein the nucleic acid bounded by inverted terminal 
repeat sequences is inserted' in a first nucleic acid vector and the transcription cassette 

15 capable of expressing an adeno-associated virus Rep protein is inserted in a second nucleic 
acid vector. 

48. The method of Claim 1 , further comprising the step of irradiating the avian male 
germ cells, thereby cleaving the nuclic acid, wherein the radiation is selected from the group 

20 consisting of ultraviolet light, gamma rays, X-rays, and ultrasound. 

49. The method of Claim 1 , wherein the avian oocyte is an isolated oocyte, and wherein 
the avian male germ cells having the nucleic acid are delivered to the isolated oocyte by a 
method selected from the group consisting of microinjection, intracytoplasmic sperm 

25 injection (ICSI), and artificial insemination. 

50. The method of Claim 49, wherein the avian male germ cells having the nucleic acid 
therein are delivered to the nucleus of the oocyte. 

30 51. The method of Claim 1 , wherein the nucleic acid forms an episome in the avian 
male germ cells. 

52. The method of Claim 1 , wherein the nucleic acid in the avian oocyte is an episome. 

35 ' 
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53. The method of Claim 1, further comprising isolating an avian oocyte from the 
female of an avian by : 

(a) removing an ovum from a bird after ovulation and before fertilization; and 

(b) removing an albumen layer from the ovum. 

5 

54. The method of Claim 1 , further comprising the steps of: 

(a) fistulating an avian female; 

(b) delivering the transgenic avian zygote to the infundibulum of the avian 
female such that said transgenic avian zygote is subsequently laid by said 

10 avian female as a shelled egg; and 

(c) incubating the shelled egg until said shelled egg hatches, 
thereby producing a transgenic avian containing the transgene. 



15 55. The method of Claim 54, wherein the heterologous polypeptide is expressed in one 
or more cells of said transgenic avian . 

56. The method of Claim 55, wherein the heterologous polypeptide is expressed in the 
serum of said transgenic avian. 

20 

57. The method of Claim 55, wherein the heterologous polypeptide is expressed in the 
magnum of said transgenic avian. 

58. The method of Claim 54, further comprising the step of allowing the transgenic 
25 avian to develop to sexual maturity. 

59. The method of Claim 58, wherein the heterologous polypeptide is delivered to the 
white of a developing avian egg produced by the transgenic avian. 

30 60. The method of Claim 55 or 59 further comprising isolating said heterologous 
polypeptide from said transgenic avian or an egg produced by the transgenic avian. 

61. A transgenic avian that produces at least one heterologous polypeptide in egg white, 
wherein the transgenic avian or founder ancestor of said transgenic avian was not produced 
35 using a eukaryotic viral vector. 
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62. A transgenic avian produced by the method of Claim 54. 

63. The transgenic avian of Claim 61 or 62, wherein the avian is a chicken. 

5 64. The transgenic avian of Claim 63, wherein the heterologous polypeptide is selected 
from the group consisting of a cytokine, a hormone, an enzyme, a structural protein, and an 
immunglobulin polypeptide. 

65. The transgenic avian of Claim 63, wherein the cytokine is an interferon. 

10 

66. The transgenic avian of Claim 6 1 or 62, wherein the transgenic avian produces a 
heterologous multimeric protein. 

67. The transgenic avian of Claim 66, wherein the heterologous multimeric protein 
15 specifically binds a selected ligand. 

68. The transgenic avian of Claim 66, wherein the heterologous multimeric protein is an 
antibody. 

20 69. An avian egg produced by the transgenic avian of Claim 61 or 62. 

70. An avian egg produced by the transgenic avian of any of Claims 63-68. 

71 . A heterologous protein heterologous protein produced by the transgenic avian of 
25 Claim 61 or 62, wherein the heterologous protein comprises a heterologous polypeptide 

selected from the group consisting of a cytokine, a hormone, an enzyme, a structural 
protein, and an immunoglobulin polypeptide. 

The heterologous polypeptide of Claim 71, wherein the cytokine is an interferon. 

The heterologous protein of Claim 71, wherein the heterologous protein is a 
multimeric protein. 



72. 

30 

73. 



74. 

35 



The heterologous protein of Claim 71, wherein the heterologous protein is an 
antibody. 
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SEQIDNO:6 

TGCCGCCTTC T7TGATATTC ACTCTGTTGT ATTTCATCTC TTCTTGCCGA TGAAAGGATA 60 
TAACAGTCTG TATAACAGTC TGTGAGGAAA TACTTGGTAT TTCTTCTGAT CAGTGTTTTT 120 
ATAAGTAATG TTGAATATTG GATAAGGCTG . TGTGTCCTTT GTCTTGGGAG ACAAAGCCCA 180 
CAGCAGGTGG TGGTTGGGGT GGTGGCAGCT CAGTGACAGG AGAGGTTTTT TTGCCTGTTT 240 
TTTTTTTTTT TTTTTTTTTT AAGTAAGGTG TTCTTTTTTC TTAGTAAVTT TTCTACTGGA 300 
CTGTATGTTT TGACAGGTCA GAAACATTTC TTCAAAAGAA GAACCTTTTG GAAACTGTAC 360 
AGCCCTTTTC TTTCATTCCC TTTTTGCTTT CTGTGCCAAT GCCTTTGGTT CTGATTGCAT 420 
TATGGAAAAC GTTGATCGGA ACTTGAGGTT TTTATTTATA GTGTGGCTTG AAAGCTTGGA 480 
TAGCTGTTGT TACACGAGAT ACCTTATTAA GTTTAGGCCA GCTTGATGCT TTATTTTTTC 540 
CCTTTGAAGT AGTGAGCGTT CTCTGGTTTT TTTCCTTTGA AACTGGTGAG GCTTAGATTT 600 
TTCTAATGGG AfTTTTTACC TGATGATCTA GTTGCATACC CAAATGCTTG TAAATGTTTT 660 
CCTAGTTAAC /^GTTGATAA CTTCGGATTT ACATGTTGTA TATACTTGTC ATCTGTGTTT 720 
CTAGTAAAAA TATATGGCAT TTATAGAAAT ACGTAATTCC TGATTTCCTT TTTTTTTATC 780 
TCTATGCTCT GTG^GTACAG GTCAAACAGA CTTCACTCCT ATTTTTATTT ATAGAATTTT 840 
ATATGCAGTC TGTGTTGGT TCTTGTGTTG TAAGGATACA GCCTTAAATT TCCTAGAGCG 900 
ATGCTCAGTA AGGCGGGTTG TCACATGGGT TCAAATGTAA AACGGGCACG TTTGGCTGCT 960 
GCCTTCCCGA G^CCAGGAC ACTAAACTGC TTCTGCACTG AGGTATAAAT CGCTTCAGAT 1020 
CCCAGGGAAG TGCAGATCCA CGTGCATATT CTTAAAGAAG AATGAATACT TTCTAAAATA 1080 
TTTTGGCATA GGAAGCAAGC TGCATGGATT TGTTTGGGAC TTAAATTATT TTGGTAACGG 1140 
AGTGCATAGG ttttAAACAC AGTTGCAGCA TGCTAACGAG TCACAGCGTT TATGCAGAAG 1200 
TGATGCCTGG ATGCCTGTTG CAGCTGTTTA CGC-CACTGGC TTGCAGTGAG CATTGCAGAT 1260 
AGGGGTGGGG TG'TTTGTGT CGTGTTCCCA CACGCTGCCA CACAGCCACC TCCCGGAACA 1320 
CATCTCACCT GCTGGGTACT TTTCAAACCA TCTTAGCAGT AGTAGATGAG TTACTATGAA 1380 
ACAGAGAAGT TCCTCAGTTG GATATTCTCA TGGGATGTCT TTTTTCCCAT GTTGGGCAAA 1440 
GTATGATAAA GCATCTCTAT TTGTAAATTA TGCACTTGTT AGTTCCTGAA TCCTTTCTAT IS 00 
AGCPCCACTT A7TGCAGCAG GTGTAGGCTC TGGTGTGGCC TGTGTCTGTG CTTCAATCTT IS 60 
TTAAAGCTTC TTTGGAAATA CACTGACTTG ATTGAAGTCT CTTGAAGATA. GTAAACAGTA 1620 
CTTACCTTTG ATCCCAATGA AATCGAGCAT TTCAGTTGTA AAAGAATTCC GCCTATTCAT 1680 
ACCATGTAAT GTAATTTTAC ACCCCCAGTG CTGACACTTT GGAATATATT CAAGTAATAG 1740 
ACTTTGGCCT OCCCTCTTG TGTACTGTAT TTTGTAATAG AAAATATTTT AAACTGTGCA 1800 
TATGATTATT ACATTATGAA AGAGACATTC TGCTGATCTT CAAATGTAAG AAAATGAGGA 1860 
GTGCGTGTGC TTTTATAAAT ACAAGTGATT GCAAATTAGT GCAGGTGTCC TTAAAAAAAA 1920 
AAAAAAAAAG TAATATAAAA AGGACCAGGT GTTTTACAAG TGAAATACAT TCCTATTTGG 1980 
TAAACAGTTA CATTTTTATG AAGATTACCA GCGCTGCTGA CTTTCTAAAC ATAAGGCTGT 2040 
ATTGTCTTCC TG T 1CCATTG CATTTCCTCA TTCCCAATTT GCACAAGGAT GTCTGGGTAA 2100 
ACTATTCAAG AAATGGCTTT GAAATACAGC" ATGGGAGCTT GTCTGAGTTG GAATGCAGAG 2160 
TTGCACTGCA AAATGTCAGG AAATGGATGT CTCTCAGAAT GCCCAACTCC AAAGGATTTT 2220 
ATATGTGTAT ATAGTAAGCA GTTTCCTGAT TCCAGCAGGC CAAAGAGTCT GCTGAATGTT 2280 
GTGTTGCCGG AGACCTGTAT TTCTCAACAA GGTAAGATGG TATCCTAGCA ACTGCGGATT 2340 
TTAATACATT TTCAGCAGAA GTACTTAGTT AATCTCTACC TTTAGGGATC GTTTCATCAT 2400 
TTTTAGAtS TATACTTGAA ATACTGCATA ACTTTTAGCT TTCATGGGTT CCTTTTTTTC 2460 
. AGCCtSaGG AGACTGTTAA GCAATTTGCT GTCCAACTTT TGTGTTGGTC TTAAACTGCA 2S20 
ATAGTAGTTT ACCTTGTATT GAAGAAATAA AGACCATTTT TATATTAAAA AATACTTTTG 2S80 
££££££ iSSSS TCTGATATCC TTGCAGTGCC CATTATGTCA GTTCTGTCAG 264 
ATATTCAGAC ATCAAAACTT AACGTGAGCT CAGTGGAGTT ACAGCTGCGG TTTTGATGCT 2700 
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GTTA.TTATTT CTGAAACTAG AAATGATGTT GTCTTCATCT GCTCATCAAA CACTTCATGC 2760 
SSSS gSaOTGAGA AATGCATACA TTTATTGATA CTTTTTTAAA GTCAACTTTT 2820 
tSSg^TTT TTTTTTCATT TGGAAATATA TTGTTTTCTA GACTGCATAG CTTCTGAATC 2880 
tcSaTGCAG TCTGATTGGC ATGAAGAAGC ACAGCACTCT TCATCTTACT TAAACTTCAT 2940 
Stggaatga AGGAAGTTAA gcaagggcac AGGTCCATGA AATAGAGACA GTGCGCTCAG 3000 
GAGAAAGTGA ACCTGGATTT CTTTGGCTAG TGTTCTAAAT CTGTAGTGAG GAAAGTAACA 3060 
CCCGATTCCT TGAAAGGGCT CCAGCTTTAA TGCTTCCAAA TTGAAGGTGG CAGGCAACTT 3120 
GGCCACTGGT TATTTACTGC ATTATGTCTC AGTTTCGCAG CTAACCTGGC TTCTCCACTA 3180 
SgAGCATGG ACTATAGCCT GGCTTCAGAG GCCAGGTGAA GGTTGGGATG GGTGGAAGGA 3240 
SS^GGGCT GTGGCTGGGG GGACTGTGGG GACTCCAAGC TGAGCTTGGG GTGGGCAGCA 3300 
CAGGGAAAAG TGTGGGTAAC TATTTTTAAG TACTGTGTTG CAAACGTCTC ATCTGCAAAT 3360 
SSSSgGTG TGTACTCTCG AAGATTAACA GTGTGGGTTC AGTAATATAT GGATGAATTC 3420 
aSSgGAAG CATTCAAGGG TAGATCATCT AACGACACCA GATCATCAAG. CTATGATTGG 3480 
aScGgIa^C AGAAGAGCGA GGAAGGTAAG CAGTCTTCAT ATGTTTTCCC TCCACGTAAA 3540 
SaSSSg AAAGTAGCAC CCCTTGAGCA GAGACAAGGA AATAATTCAG GAGCATGTGC 3600 
TAGGAGAACT TTCTTGCTGA ATTCTACTTG CAAGAGCTTT GATGCCTGGC TTCTGGTGCC 3660 
SSScAGCA CCTGCAAGGC CCAGAGCCTG TGGTGAGCTG GAGGGAAAGA TTCTGCTCAA 3720 
CTcSScS CAGCAGGTCA TTGTCTTTGC TTCTTCCCCC AGCACTGTGC AGCAGAGTGG 3780 
ScSaTgS GAAGCCTCCT GTCCACTACC TGTTGCTGCA GGCAGACTGC TCTCAGAAAA 3840 
AGAGAGCTAA CTCTATGCCA TAGTCTGAAG GTAAAATGGG TTTTAAAAAA GAAAACACAA 3900 
SaScC GGCTGCCCCA TGAGAAGAAA GCAGTGGTAA ACATGGTAGA AAAGGTGCAG 3960 
AAGCCCCCAG GCAGTGTGAC AGGCCCCTCC TGCCACCTAG AGGCGGGAAC AAGCTTCCCT 4020 
ctcTAGSc? CTGCCCGCGA AGTGCGTGTT . TCTTTGGTGG GTTTTGTTTG GCGTTTGGTT 4080 
TTGAGATTTA GACACAAGGG AAGCCTGAAA GGAGGTGTTG GGCACTATTT TGGTTTGTAA 4140 
■ AGCCtoSS ?CAAATATAT ATTTTGTGAG.GGAGTGTAGC GAATTGGCCA ATTTAAAATA 4200 
SgSgSS ISSSaGG CTGAGTAGTT GAGAGGGTAA CACGTTTAAT GAGATCTTCT 4260 
SaISaSg CTTCTAAACA CTTGTTTGAG TGGTGAGACC TTGGATAGGT GAGTGCTCTT 4320 
GT^S5gTC TGATGCACTT GCTTGTCCTT TTCCATCCAC ATCCATGCAT TCCACATCCA 4380 
SSJc aSStCCCA TATCTGTCAT ATCTGACATA CCTGTCTCTT CGTCACTTGG 4440 
TC^Sa CaStGTgS AATCCCCAGC CGCCCCAAGT TTGAGAAGAT GGCAGTTGCT 4500 

TTTCCTGCTA AGTAAGGATT-TTCTCCTGGC TTTGACACCT CACGAAATAG 4560 
tSScTGCC ScATTCTG GGCATTATTT CAAATATCTT TGGAGTGCGC TGCTCTCAAG 4620 
SgSS CCTACTCTTA GAGTGAATGC TCTTAGAGTG AAAGAGAAGG AAGAGAAGAT 4680 
gSScgS gSctCTGAT* GAACACACCT CTGAATAATG GCCAAAGGTG GGTGGGTTTC 4740 
TCTGAGcSS gSSgCgS TGCCTCTGAA AGCAAGGAGC TCTGCGGAGT. TGCAGTTATT 4800 

SSSSS GGTGCTTAAA GCAGATTCCC TAGGTTCCCX GCTACTTCTT 486 
ScOTCTTG GCAGTCAGTT TATTTCTGAC AGAGAAACAG CCACCCCCAC TGCAGGCTTA 4920 
gISSSg? S?cScS GGGTGTGTTA CAGCTCTGCC CTGGTGAAAG GGGATTAAAA 4980 
CC^SS tISSSaA CAGGATCCTC ATTCATGGAT CAAGCXGTAA <^CTTGGG 5 4 
CTCCAACCTC AAAACATTAA TTGGAGTACG AATGTAATTA AAACTGCATT CTCGCATTCC 5100 

SaScSS a^SSS ctgcagcatg taggtcggca ? ctcccactt tctcaaagac 5 

GGAGTAGTAA aaatggagac CGATTCAGAA CAACCAACGG AGTGTTGCCG b^ju 

SSSSS £££££ ^^^^ 5280 

CTACTTCAAA TGAGGTCGGA QAAGGTCAGT GTTrTATTAG tSS^ 



5340 



CTACTTCAAA TGAGGTCGtiA liHA^i^ -~ -~" ._ Bn „_ TrTr c 400 

CGAGTACCAT TTTTCTCTAC AAGAAAAACG ATTCTGAGCT CTGCGTAAGT ATAAGTTCTC 5400 
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CATAGCGGCT GAAGCTCCCC CCTGGCTGCC 
CCTTGGGGTT TCTCTCACAG CAGTAATGGG 
TGTCATGTGG GATCCCTACT GTGCCCTCCT 
CAGCGGTTTG GAAAGAGAAA AAGAATTTGG 
CCAGCATTTT GGTTTTTAAT TATGTCAATA 
TGGGTGTATT ACCGAGGAAC AAAGGAAGGC 
ACTGGCAAGC TGTCAAAAAC AAAAAGGCCT 
GCCAGCAGGG CCAGCACGAG GGATGGTGCA 
ACTCTGAGAG CAACTGCTTT GGAAATGACA 
TGCGTAGAGC GTGTGCTTGG CGACAGTTTT 
TCCTCATTCT CCTAAGCATG TCTCCATGCT 
ATGAATCCAT CACTGTAGGA TTCTCGTGGT 
ATGGAAGCTT ATTTATTTTT CGTTCTTCCA 
ACCACAGCAA ATTAAAGGTG AAGGAGGCTG 
TTCTTCCTTG CAAGGCCACA GGAAAATGCT 
AGTTCAGTCT CCTGCTGGGA C2AGCTAACCG 
AGGACCAAAT AGGGTCTATC TGGGGTTTTT 
CACTATTTCA CTGCTCCCAC GGTTACAAAC 
ACATTACATA AATTTGACCT GGTACCAATA 
CTGTGTTTAA CCCCTTAAGG CATTCAGAAC 
AGGGGCCTTA AACATCATCC ATTTCCAACC 
CTCAGGCTGC CCAGGGCCCC ATCCAGCCTG 
' ACAGCTTCTC TGGGCAGCCT GTGCCAACAC 
TTAACATCTA ATCTAAATCT CTTCTCTTTT 
CTATCTGTCC AAGAAATGTG TATTGGTCTC 
GGCTGCAGTG AGGTCTCCCC ACAGCCTTCT 
CAGCCTGTCT TCGTAGGAGA TCATCTTAGT 
CACGGCTTTC TTGTGGAGCC CCAGGTCTGG 
GCAGAGCAGA TGGGGACAAT CGCTTACCCC 
CCCAGGGTAC TGTTGGCCTT TCAGGCTCCC 
CATCCACCAG AACCCACGCT TCCTGGTTAA 
TCAGGAGACT TCCATTCTTT AGGACAGACT 
ATATACATTT CAGTTCATGT TTCCTGTAAC 
TACATGCAGA ATTCCTAGTG CCATCTCAGT 
CAATTTGCTG CAAGTACCTT CCAAGCTGCG 
TTACCTTTTG .GGGTAAGCTT TTGTATCTGC 
CTCTGCTCTG TTCTGACTGC ACCATTTTCT 
TTGTCCTCCA TCCTTTCCCA GCTTGTATCT 
CTTCAGCAGC CATTTAATTC TTCAGTGTCA 
TTTTCAGCAG TCTTGCAAAG AACATCTAGC 
CAGTTCTTCT TGTTTGAGGT GAGCCATAAA 
GCATTTTATT ACTTCTATTA TGTACTTACT 
CTGGGATTTC CACAGTGTCT CTGTGTCCTT 
AACCTTGGCA ATCTGCCCAG CTGCCCATCA 



TGCCATCTCA GCTGGAGTGC AGTGCCATTT 5460 
ACAATACTTC ACAAAAATTC TTTCTTTTCC 5520 
GGTTTTACGT TACCCCCTGA CTGTTCCATT 5580 
AAATAAAACA TGTCTACGTT ATCACCTCCT 5640 
ACTGGCTTAG ATTTGGAAAT GAGAGGGGGT '5700 
TTATATAAAC TCAAGTCTTT TATTTAGAGA 5-760 
TACCACCAAA TTAAGTGAAT AGCCGCTATA 5820 
CTGCTGGCAC TATGCCACGG CCTGCTTGTG 5880 
GCACTTGGTG CAATTTCCTT TGTTTCAGAA 5940 
TCTAGTTAGG CCACTTCTTT TTTCCTTCTC 6000 
GGTAATCCCA GTCAAGTGAA CGTTCAAACA 6060 
GATCAAATCT TTGTGTGAGG TCTATAAAAT 6120 
TATCAGTCTT CTCTATGACA ATTCACATCC 6180 
GTGGGATGAA GAGGGTCTTC TAGCTTTACG 6240 
GAGAGCTGTA GAATACAGCC TGGGGTAAGA 6300 
CATCTTATAA CCCCTTCTGA GACTCATCTT 6360 
GTTCCTGCTG TTCCTCCTGG A^GGCTATCT 6420 
CAAAGATACA GCCTGAATTT TTTCTAGGCC 6480 
TTGTTCTCTA TATAGTTATT TCCTTCCCCA 6540 
AACTAGAATC ATAGAATGGT TTGGATTGGA 6600 
CTCTGCCATG GGCTGCTTGC CACCCACTGG 6660 
GCCTTGAGCA CCTCCAGGGA TGGGGCACCC 6720 
CTCACCAQTC TCTGGGTAAA GAATTCTCTT 6780 
AGTTTAAAGC CATTCCTCTT TTTCCCGTTG 6840 
CCTCCTGCTT ATAAGCAGGA AGTACTGGAA 6900 
CTTCTCCAGG CTGAACAAGC COVGCTCCTT 6960 
GGCCCTCQTC TGGACCCATT CCAACAGTTC 7020 
ATGCAGTACT TCAGATGGGG CCTTACAAAG 7080 
TCCCTGCTGG CTGCCCCTGT TTTGATGCAG 7140 
AGACCCCTTG CTGATTTGTG TCAAGCTTTT 7200 
TACTTCTGCC CTCACTTCTG T.^AGCTTGTT 7260 
GTGTTACACC TACCTGCCCT ATTCTTGCA.T 7320 
AGGACAGAAT ATGTATTCCT CTAACAAAAA 7380 
AGGGTTTTCA TGGCAGTATT AGC^CATAGT 7440 
GCCTCCCATA AATCCTGTAT TTGGGATCAG 750 Q 
AGAGACCCTG GGGGTTCTGA TGTGCTTCAG 7560 
AGATCACCCA GTTGTTCCTG TACAACTTCC 7620 
TTGACAAATA CAGGCCTATT TTTGTGTTTG 7680 
TCXTGTTCTG TTGATGCCAC TGGAACAGGA 7740 
TGAAAACTTT CTGCCATTCA ATATTCTTAC 7800 
TTACTAGAAC TTCGTCACTG ACAAGTTTAT 7860 
TTGACATAAC ACAGACACGC ACATATTTTG 7920 
CACATGGTTT TACTGTCATA CTTCCGTTAT 7980 
CAAGAAAAGA GATTCCTTTT TTATTACTTC 8040 
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ATAAACAA AA TGTGAGAAGC CCAAACAAGA ACTTGTGGGG CAGGCTGCCA 8100 
TCAAGGGAGA GACAGCTGAA GGGTTGTGTA GCTCAATAGA ATTA^GAAAX AATAAAGCTG 8160 
TGTCAGACAG TTTTGCCTGA TTTATACAGG CACGCCCCAA GCCAGAGAC-C- CTGTCTGCCA 8220 
^GGCCACCTT GCAGTCCTTQ GXTTGTAAGA TAAGTCATAG GTAACTTTTC TGGTGAATTG "8280 
CGTGGAGAAT CATGATGGCA GTTCTTGCTG TTTACTATGG TAAGATGC7A AAAT AGGAGA 8340 
CAGCAAAGTA ACACTTGCTG CTGTAGGTGC TCTGCTATCC AGACAGCGAT GGCACTCGCA 8400 
CACCAAGATG AGGGATGCTC CCAGCTGACG GATGCTGGGG GAGTAACAGT GGGTCCCATG 8460 
CIGCCTGCTC ATTAGCATCA CCTCAGCCCT CACCAGCCCA TCAGAAGGAT CATCCCAAGC 8520 
TGAGGAAAGT TGCTCATCTT CTTCACATCA TCAAACCTTT GGCCTGACXG ATGCCTCCCG 8580 
GATGCTTAAA TGTGGTCACT GACATCTTTA TTTTTCTATG ATTTCAAGTC AGAACCTCCG 8S40 
GATCAGGAGG GAACACATAG TGGGAATGTA CCCTCAGCTC CAAGGCCAGA TCTTCCTTCA 8700 
ATGAT CATGC ATGCTACTTA GGAAGGTGTG TGTGTGTGAA TGTAGAAT7G CCTTTGTTAT 8760 
TTTTTCTTCC TGCTGTCAGG AACATTTTGA ATACCAGAGA AAAAGAAAAG TGCTCTTCTT 8820 
GGCATGGGAG GAGTTGTCAC ACTTGCAAAA TAAAGGATGC AGTCCCAAAT GTTCATAATC 8880 
TCAGGGTCTG AAGGAGGATC AGAAACTGTG TATACAATTT CAGGCTTCTC TGAATGCAGC 8940 
TTTTGAAAGC TGTTCCTGGC CGAGGCAGTA CTAGTCAGAA CCCTCGGAAA CAGGAACAAA 9000 
TGTCTTCAAG GTGCAGCAGG AGGAAACACC TTGCCCATCA TGAAAGTGAA TAACCACTGC 9060 
rGCTGAAGGA ATCCAGCTCC TGTTTGAGCA GGTGCTGCAC ACTCCCACAC TGAAACAACA 9120 
GTTCATtSt ATAGGACTTC CAGGAAGGAT CTTCTTCTTA AGCTTCT7AA TTATGGTACA 9180 
TCTCCAGTTG GCAGATGACT ATGACTACTG ACAGGAGAAT GAGGAACTAG CTGGGAATAT 9240 
VTCTGTTTGA CCACCATGGA GTCACCCATT TCTTTACTGG TATTTGGAAA TAATAATTCT 9300 
GAATTGCAAA GCAGGAGTTA GCGAAG AT CT TCATTTCTTC CATGTTGGTG ACAGCACAGT 9360 
TCTGGCTATG AAAGTCTGCT TACAAGGAAG AGGATAAAAA TCATAGGGAT AATAAATCTA 9420 
AGTTTGAAGA CAATGAGGTT TTAGCTGCAT TTGACATGAA GAAATTGAGA CCTCTACTGG 9480 
ATAGCTATGG TATTTACGTG TCTTTTTGCT TAGTTACTTA TTGACCCCAG CTGAGGTCAA 9540 
'g^SSaCTC AGGTCTCTCG GGCTACTGGC ATGGATTGAT TACATACAAC TGTAATTTTA 9600 
GCAGTGATTT AGGGTTTATG AGTACTTTTG CAGTAAATCA XAGGGTT.^ JATCTTAATC ^60 
TCAGGGAAAA AAAAAAAAAG CCAACCCTGA CAGACATCCC AGCTCAGGTG GAAATCAAGG 9720 
aSSSSc* AGTGCGGTCC CAGAGAACAC AGGGACTCTT CTCTTAGGAC CTTTATGTAC 9780 
aScc?CAA GATAACTGAT GTTAGTCAGA AGACTTTCCA TTCTGGCCAC AGTTCAGCTG 9840 
aSaScS SaAtStS CTCCGCTGCA CAGTTCCAGT CATCCCAGTT TGTACAGTTC 9900 
JSSSSJ Sg^TCAGGC CGTGATCCAA GGAGCAGAAG TTCCAGCTAT GGTCAGGGAG 9960 
TGCCTGACCG TCCcIaCTCA CTGCACTCAA ACAAAGGCGA AACCACAAGA GTGGCTTTTG 10020 
CTgSaTTGC AGTGTGGCCG AGAGGGGCTG CACCAGTACT GGATTGACCA CGAGGCAACA 10080 
SaatcSS ScaIgtccaa TTTGCAGCCA TTAAATTGAACTAACTGATA CTACAATGCA 10140 
ATCAGTATCA ACAAGTGGTT TGGCTTGGAA GATGGAGTCT AGGGGCTCTA CAGGAGTAGC 10200 
SctSSS ?GGAGTTGCA TTTTGAAGCA GGACACTGTG AAAAGCTC-GC CTCCTAAAGA 10260 
GGCTGCTAAA CATTAGGGTC AATTTTCCAG TGCACTTTCT GAAGTGTCTG CAGTTCCCCA 10320 
TGCAAAGCTG SSS£ SaCTTCCAA TTGAATACAA TTATATGCAG ^TACTGCT 10380 
TCTTGCCAGC ACTGTCCTTC TCAAATGAAC TCAACAAACA *™JMGT CTAGTAGAAA 10440 
GTAACAAGCT TTGAATGTCA TTAAAAAGTA TATCTGCTTT CAGTAGTTCA GCTTATTTAT 10500 
GCCCACTAGA AACATCTTGT ACAAGCTGAA CACTGGGGCT CCAGATTAGT GGTAAAACCT 10560 
ACtSSaCA MCATAGAAT CATAGAATGG CCTGGGTTGG AAGGGACCCC AAGGATCATG 10620 
aSSS AcSccScA CAGGCAGGGC CACCAACCTC CAGATCTGGT ACTAGACCAG 10680 
G^cSSS ScCATCCA ACCTGGCCAT GAACACCTCC AGGGATGGAG CATCCACAAC 10740 
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CTCTCTGGGC AGCCTGTGCC AGCACCTCAC 
ATCCAATCTA AGCCTTCCCT CCTTGAGGTT 
TACTCTTGTA AAAAGTTGAT TCTCCTCCTT 
GCGTTCTTCT CTTCTGCAGG ATGAACAAGC 
GGTGCTCCAG CCCTCTGATC ATCTTTGTGG 
CATCTTTCCT GTACTGGGGG CCCCAGGCCT 
GAGCAGAGTA AAGAGGGACA ATCACCTTCC 
AGCCCTGGAT ACAACTGGCT TTCTGAGCTG 
ACAGGAACAA TACAACAGGT GCTGATGGCC 
GGTAGATCTT AGATGAGGAA CGTTGAAGTT 
ATACTCCTGC CTGATACCTC ACCCCACCTG 
CAGGGCCCTG ATGAACCCGG CACTGCTTCA 
TTGCACCTAT GAATACACAA ACAATGTGTT 
AATTTGCATT GTCAGGAAAT GGTTTAGTAA 
TGGCTGTTTT TATGGCTGTT AGTAGTGGTA 
AATCAAGACT GTAGATATTG CAACAGACTA 
TACTTCCCAC ATTGT AT AAG AAATTTGGCA 
ATTTCTGTAT ACTCAAGAGG GCGTTTTTGA 
TGGGAGGAAG TTAAAAGAAG AGGCAGGTGC 
AC ACTGGCAA * CATGAGGTCT TTGCTAATCT 
TAGGGTGCGA TCTGCCTCAG ACCCACAGCC 
CTCAGATGAG GAGAATCAGC CTGTTTAGCT 
CTCAAGAGGA GTTTGGCAAC CAGTTTCAGA 
TGATCCAGCA GATCTTTAAC CTGTTTAGCA 
CCCTGCTGGA TAAGTTTTAC ACCGAGCTGT 
TGATCCAGGG CGTGGGCGTG ACCGAGACCC 
TGAGGAAGTA CTTTCAGAGG ATCACCCTGT 
CTTGGGAAGT CGTGAGGGCT GAGATCATGA 
AGAGCTTGAG GTCTAAGGAG TAAAAAGTCT 
ACATGATAAG ATACATTGAT GAGTTTGGAC 
• GCTTTATTTG TGAAATTTGT GATGCTATTG 
AACAAGTTAA CAACAACAAT TGCATTCATT 
AGGTTTTTTA AAGCAAGTAA AACCTCTACA 
GCGGCCGC 12728 



CACCCTCTCT GTGAAGAACT TTTCCCTGAC 10800 
AGATCCACTC CCCCTTGTGC TATCACTGTC 10860 
TTTGGAAGGT TGCAATGAGG TCTCCTTGCA i0920 
CCAGCTCCCT CAGCCTGTCT TTATAGGAGA 10980 
CCCTCCTCTG GACCCGCTCC AAGAGCTCCA 11040 
GAATGCAGTA CTCCAGATGG GGCCTCAAAA 11100 
TCACCCTGCT GGCCAGCCCT CTTCTGATGG 11160 
C^ACTTCTCC TTATCAGTTC CACTATTAAA 11220 
AGTGCAGAGT TTTTCACACT TCTTCATTTC 11280 
GTGCTTCTGC GTGTGCTTCT TCCTCCTCAA 11340 
CCACTGAATG GCTCCATGGC CCCCTGCAGC 11400 
GATGCTGTTT AATAGCACAG TATGACCAAG 11460 
GCATCCTTCA GCACTTGAGA AGAAGAGCCA 11520 
TTCTGCCAAT TAAAACTTGT TTATCTACCA 11580 
CACTGATGAT GAACAATGGC TATGCAGTAA 11640 
TAAAATTCCT CTGTGGCTTA GCCAATGTGG 11700 
AGTTTAGAGC AATGTTTGAA GTGTTGGGAA 11760 
CAACTGTAGA ACAGAGGAAT CAAAAGGGGG 11820 
A^GAGAGCTT GCAGTCCCGC TGTGTGTACG 11880 
TGGTGCTTTG CTTCCTGCCC CTGGCTGCCT 11940 
TGGGCAGCAG GAGGACCCTG ATGCTGCTGG 12000 
GCCTGAAGGA TAGGCACGAT TTTGGCTTTC 12060 
AGGCTGAGAC CATCCCTGTG CTGCACGAGA 12120 
CCAAGGATAG CAGCGCTGCT TGGGATGAGA 12180 
ACCAGCAGCT GAACGATCTG GAGGCTTGCG 12240 
CTCTGATGAA GGAGGATAGC ATCCTGGCTG 12300 
ACCTGAAGGA GAAGAAGTAC AGCCCCTGCG 12360 
GGAGCTTTAG CCTGAGCACC AACCTGCAAG 12420 
AGAGTCGGGG CGGCCGGCCG CTTCGAGCAG 12480 
AAACCACAAC TAGAATGCAG TGAAAAAAAT 12540 
CTTTATTTGT AACCATTATA AGCTGCAATA 12600 
TTATGTTTCA GGTTCAGGGG GAGGTGTGGG 12660 
AATGTGGTAA AATCGATAAG GATCCGTCGA 12720 
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SEQIDNO:5 

TGCGATCTGC CTCAGACCCA CAGCCTGGGC AGCAGGAGGA 
ATGAGGAGAA TCAGCCTGTT TAGCTGCCTG AAGGATAGGC 
GAGGAGTTTG GCAACCAGTT TCAGAAGGCT GAGACCATCC 
CAGCAGATCT TTAACCTGTT TAGCACCAAG GATAGCAGCG 
CTGGATAAGT TTTACACCGA GCTGTACCAG CAGCTGAACG 
CA6GGCGTGG GCGTGACCGA GACCCCTCTG ATGAAGGAGG 
AAGTACTTTC AGAGGATCAC CCTGTACCTG AAGGAGAAGA 
GAAGTCGTGA GGGCTGAGAT CATGAGGAGC TTTAGCCTGA 
TTGAGGTCTA AGGAGTAA 498 



CCCTGATGCT GCTGGCTCAG 60 

ACGATTTTGG CTTTCCTCAA 120 

CTGTGCTGCA CGAGATGATC 180 

CTGCTTGGGA TGAGACCCTG 240 

ATCTGGAGGC TTGCGTGATG 300 

ATAGCATCCT GGCTGTGAGG 360 

AGTACAGCCC CTGCGCTTGG 420 

GCACCAACCT GCAAGAGAGC 480 
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TGCCGCCTTC TTTGATATTC ACTCTGTTGT ATTTCATCTC TTCTTGCCGA TGAAAGGATA 60 
TAACAGTCTG TATAACAGTC TGTGAGGAAA TACTTGGTAT TTCTTCTGAT CAGTGTTTTT 120 
AtJaGTAATG TTGAATATTG GATAAGGCTG TGTGTCCTTT GTCTTGGGAG ACAAAGCCCA .180 
CAGCAGGTGG TGGTTGGGGT GGTGGCAGCT CAGTGACAGG AGAGGTTTTT ITGCCTGTTT 240 
TTTTTTTTTT TTTTTTTTTT AAGTAAGGTG TTCTTTTTTC TTAGTAAATT TTCTACTGGA 300 
CTGTATGTTT TGACAGGTCA G AAACATTT C TTCAAAAGAA GAACCTTTTG GA^ACTGTAC 360 
AGCCCTTTTC TTTCATTCCC TTTTTGCTTT CTGTGCCAAT GCCTTTGGTT CTGATTGCAT 420 
TATGGAAAAC GTTGATCGGA ACTTGAGGTT TTTATTTATA GTGTGGCTTG AAAGCTTGGA 480 
TAGCTGTTGT TACACGAGAT ACCTTATTAA GTTTAGGCCA GCTTGATGCT TTATTTTTTC 540 
CCTTTGAAGT AGTGA.GCGTT CTCTGGTTTT TTTCCTTTGA AACTGGTGAG GCTTAGATTT 600 
TTCTAATGGG ATTTTTTACC TGATGATCTA GTTGCATACC CAAATGCTTG TAAATGTTTT 660 
CCTAGTTAAC ATGTTGATAA CTTCGGATTT ACATGTTGTA TATACTTGTC ATCTGTGTTT 720 
CTAGTAAAAA TATATGGCAT TTATAGAAAT ACGTA^TTCC TGATTTCCTT TTTTTTTATC 780 
TCTATGCTCT GTGTGTACAG GTCAAACAGA CTTCACTCCT ATTTTTATTT ATAGAATTTT 840 
ATATGCAGTC TGTCGTTGGT TCTTGTGTTG TAAGGATACA GCCTTAAATT TCCTAGAGCG 900 
ATGCTCaStA AGGCGGGTTG TCACATGGGT TCAAATGTAA AACGGGCACG TTTGGCTGCT 960 
GCCTTCCCGA GATCCAGGAC ACTAAACTGC TTCTGCACTG AGGTATAAAT CGCTTCAGAT 1020 
CCCAGGGAAG TGCAGATCCA CGTGCATATT CTTAAAGAAG AATGAATACT TTCTAAAATA 1080 ■ 
TTTTGGCATA GGAAGCAAGC TGCATGGATT TGTTTGGGAC TTAAATTATT TTGGTAACGG 1140 
AGTGCATAGG TTTTAAAC&C AGTTGCAGCA TGCTAACGAG TCACAGCGTT TATGCAGAAG 1200 
TG^TGCCTGG ATGCCTGTTG CAGCTGTTIA CGGCACTGCC TTGCAGTGAG CATTGCAGAT 1260 
AGGGGtSg TgItTTGTGT CGTGTTCCCA CACGCTGCCA CACAGCCACC TCCCGGAACA 1320 
CATCTCACCT GC T GGGTACT TTTCAAACCA TCTTAGCAGT AGTAGATGAG TTACTATGAA 1380 
ACaSaAGT TCCTCAGTTG GATATTCTCA TGGGATGTCT TTTTTCCCAT GTTGGGCAAA 1440 

gStS?aaa gcSctctat ttgtaaatta tgcacttgtt agttcctgaa TCCTTTCTAT 1500 

aSSSSt ATTGCAGCAG GTGTAGGCTC TGGTGTGGCC TGTGTCTGTG CTTCAATGTT 1560 
CTAMGCXTC TTTGGAAATA CACTGACTTG ATTGAAGTCT CTTGAAGATA GTAAACAGTA 1620 
CTTACCTTTG ATCCCAATGA AATCGAGCAT TTCAGTTGTA AAAGAATTCC GCCTATTCAT 1680 
ScaSSS SEEEtAC ACCCCCAGTG CTGACACTTT <^™»XT CAA^TAATAG 174 
ACTTTGGCCT CACCCTCTTG TGTACTGTAT TTTGTAATAG AAAATATTTT AAACTGTGCA 1800 
SSaS SaSSS AGAGACATTC TGCTGATCTT CAAATGTAAG AAAATGAGGA 1860 
gSgtGTGC tStATAAAT ACAAGTGATT GCAAATTAGT GCAGGTGTCC TTAAAAAAAA 1920 
AAAAAAAAAG SStaISa AGGACCAGGT GTTTTACAAG TGAAATACAT.TCCTATTTGG 1980 
TAAACAGTTA SJSSSS A^ATTACCA GCGCTGCTGA CTTTCTAAAC ATAAGGCTGT 2040 
-SiSSSS TGTACCATTG CATTTCCTCA TTCCCAATTT GCACAAGGAT GTCTGGGTAA 2100 
aSScIaG IStScTTT G^AATACAGC ATGGGAGCTT GTCTGAGTTG GAATGCAGAG 2160 
XOTCAGG AAATGGATGT CTCTCAGAAT GCCCAACTCC AAAGGATTTT 2220 
SaSSS A^SIScA GTTTCCTGAT TCCAGCAGGC CAAAGAGXCT «TGftrarT 228 
rTrT _ rrrf!r flrarrTGTAT TTCTCAACAA GGTAAGATGG TATCCTAGCA ACTGCGGATT 2340 
SSSS? ££££££ GTACTTAGTT AATCTCTACC T™GGOATC ™«T Z400 
TTTTAGATGT TATACTTGAA ATACTGCATA ACTTTTAGCT TTCATGGGTT CCTTTTTTTC 246Q 
I^SSSE GCAATTTGCT GTCCAACTTT TGTGTTGGTC 

ATAPTACTTT ACCTTGTATT GAAGAAATAA AGACCATTTT TATATTAAAA AATACTTTTG 2580 
™£S5 TCTGATATCC TTGCAGTGCC CATTATGTCA GTTCTGTCAG 26.40 

SSSS SSXSS AACGTGAGCT CAGTGGAGTT ACAGCTGCGG TTTTGATGCT 2700 
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GTTATTATTT CTGAAACTAG AAATGATGTT 
AGAGTGTAAG GCTAGTGAGA AATGCATACA 
TATCAGATTT TTTTTTCATT TGGAAATATA 
TGAAATGCAG TCTGATTGGC ATGAAGAAGC 
TTTGGAATGA AGGAAGTTAA GCAAGGGCAC 
GAGAAAGTGA ACCTGGATTT CTTTGGCTAG 
CCCGATTCCT TGAAAGGGCT CCAGCTTTAA 
GGCCACTGGT TATTXACTGC ATTATGTCTC 
TTGAGCATGG ACTATAGCCT GGCTTCAGAG 
GTGCTGGGCT GTGGCTGGGG GGACTGTGGG 
CAGGGAAAAG TGTGGGTAAC; TATTTTTAAG 
ACGTAGGGTG TGTACTCTCG AAGATTAACA 
ACAGTGGAAG CATTCAAGGG TAGATCATCT 
AAGCGGTATC AGAAGAGCGA GGAAGGTAAG 
GCAGTCTGGG AAAGTAGCAC CCCTTGAGCA 
TAGGAGAACT TTCTTGCTGA ATTCTACTTG 
TTCTGCAGCA CCTGCAAGGC CCAGAGCCTG 
GTCCAAGCTT CAGCAGGTCA TTGTCTTTGC 
AACTGATGTC GAAGCCTCCT GTCCACTACC 
AGAGAGCTAA CTCTATGCCA TAGTCTGAAG 
AGGCAAAACC GGCTGCCCCA TGAGAAGAAA 
AAGCCCCCAG GCAGTGTGAC AGGCCCCTCC 
GCCTAGGGCT CTGCCCGCGA AGTGCGTGTT 
TTGAGATTTA GACACAAGGG AAGCCTGAAA 
AGCCTGTACT TCAAATATAT ATTTTGTGAG 
AAGTTGCAAG AGATTGAAGG CTGAGTAGTT 
GAAACTACTG CTTCTAAACA CTTGTTTGAG 
GTTACATGTC TGATGCACTT GCTTGTCCTT 
CGCATTTGTC ACT TAT C CCA TATCTGTCAT 
TCAGAAGAAA CAGATGTGAT AATCCCCAGC 
TCTTTCCCTT TTTCCTGCTA AGTAAGGATT 
TCTTCCTGCC TTACATTCTG GGCATTATTT 
TTTGTGTCTT CCTACTCTTA GAGTGAATGC 
GTTGGCCGCA GTTCTCTGAT GAACACACCT 
TCTGAGGAAC GGGCAGCGTT TGCCTCTGAA 
TTGCAACTGA TGGTGGAACT GGTGCTTAAA 
TTCCTTCTTG GCAGTCAGTT TATTTCTGAC 
GAAAGTATGT GGCTCTGCCT GGGTGTGTTA 
CGGGCACCAT TCATCCCAAA CAGGATCCTC 
CTCCAACCTC AAAACATTAA TTGGAGTACG 
TAAGTCATTT AGTCTGGACT CTGCAGCATG 
CACTGATGGA GGAGTAGTAA AAATGGAGAC 
AAGAAACTGA TGGAAATAAT GCATGAATTG 
CTACTTCAAA TGAGGTCGGA GAAGGTCAGT 
CGAGTACCAT TTTTCTCTAC AAGAAAAACG 



G^TCTTCATCT GCTCATCAAA CACTTCATGC 2760 
TTTATTGATA CTTTTTTAAA GTCAACTTTT -2820 
TTGTTTTCTA GACTGCATAG CTTCTGAATC 2880 
ACAGCACTCT TCATCTTACT TAAACTTCAT 2940 
AGGTCCATGA AATAGAGACA GTGCGCTCAG 3000 
TGTTCTAAAT CTGTAGTGAG GAAAGTAACA 3060 
TGCTTCCAAA TTGAAGGTGG CAGGCAACTT 3120 
AGTTTCGCAG CTAACCTGGC TTCTCCACTA 318 0 
GCCAGGTGAA GGTTGGGATG GGTGGAAGGA 3240 
GACTCCAAGC TGAGCTTGGG GTGGGCAGCA 3300 
TACTGTGTTG CAAACGTCTC ATCTGCAAAT 3360 
GTGTGGGTTC AGTAATATAT GGATGAATTC 3420 
AACGACACCA GATCATCAAG CTATGATTGG 348 0 
CAGTCTTCAT ATGTTTTCCC TCCACGTAAA 3540 
GAGACAAGGA AATAATTCAG GAGCATGTGC 3600 
CAAGAGCTTT GATGCCTGGC TTCTGGTGCC 3660 
TGGTGAGCTG GAGGGAAAGA TTCTGCTCAA 3720 
TTCrrCCCCC AGCACTGTGC AGCAGAGTGG 378 0 
TGTTGCTGCA GGCAGACTGC TCTCAGAAAA 3840 
GTAAAATGGG TTTTAAAAAA -GAAAACACAA 3900 
GCAGTGGTAA ACATGGTAGA AAAGGTGCAG 3960 
TGCCACCTAG AGGCGGGAAC AAGCTTCCCT 4020 
TCTTTGGTGG GTTTTGTTTG GCGTTTGGTT 4080 
GGAGGTGTTG GGCACTATTT TGGTTTGTAA 4140 
GGAGTGTAGC GAATTGGCCA ATTTAAAATA 4200 
GAGAGGGTAA CACGTTTAAT GAGATCTTCT 4260 
TGGTGAGACC TTGGATAGGT GAGTGCTCTT 4320 
TTCCATCCAC ATCCATGCAT TCCACATCCA 4380 
ATCTGACATA CCTGTCTCTT CGTCACTTGG 4440 
CGCCCCAAGT TTGAGAAGAT GGCAGTTGCT 4500 
TTCTCCTGGC TTTGACACCT CACGAAATAG 4560 
CAAATATCTT TGGAGTGCGC TGCTCTCAAG 4620 
TCTTAGAGTG AAAGAGAAGG AAGAGAAGAT 4680 
CTGAATAATG GCCAAAGGTG GGTGGGTTTC 4740 
AGCAAGGAGC TCTGCGGAGT TGCAGTTATT 4800 
GCAGATTCCC TAGGTTCCCT GCTACTTCTT 4860 
AGACAAACAG CCACCCCCAC TGCAGGCTTA 4920 
CAGCTCTGCC CTGGTGAAAG GGGATTAAAA 4980 
ATTCATGGAT CAAGCTGTAA GGAACTTGGG 5040 
AATGTAATTA AAACTGCATT CTCGCATTCC 5100 
TAGGTCGGCA GCTCCCACTT TCTCAAAGAC 5160 
CGATTCAGAA CAACCAACGG AGTGTTGCCG 5220 
TGTGGTGGAC ATTTTTTTTA AATACATAAA 5280 
GTTTTATTAG CAGCCATAAA ACCAGGTGAG 5340 
ATTCTGAGCT CTGCGTAAGT ATAAGTTCTC 5400 



FIG. 3B 



WO 03/024199 



9/31 



PCT/US02/30156 



CATAGCGGCT GAAGCTCCCC CCTGGCTGCC TGCCATCTCA GCTGGAGTGC AGTGCCATTT 5460 
CCTTGGGGTT . TCTCTCACAG CAGTAATGGG ACAATACTTC ACAAAAATTC TTTCTTTTCC 5520 
TGTCATGTGG GATCCCTACT GTGCCCTCCT GGTTTTACGT TACCCCCTGA CTGTTCCATT 5580 
CAGCGGTTTG GAAAGAGAAA AAGAATTTGG" AAATAAAACA TGTCTACGTT ATCACCTCCT 5640 
CCAGCATTTT GGTTTTTAAT TATGTCAATA ACTGGCTTAG ATTTGGAAAT GAGAGGGGGT 5700 
TGGGTGTATT ACCGAGGAAC AAAGGAAGGC TTATATAAAC TCAAGTCTTT TATTTAGAGA 57 SO 
ACTGGCAAGC TGTCAAAAAC AAAAAGGCCT TACCACCAAA TTAAGTGAAT AGCCGCTATA 5820 
GCCAGCAGGG CCAGCACGAG GGATGGTGCA CTGCTGGCAC TATGCCACGG CCTGCTTGTG 5880 
ACTCTGAGAG CAACTGCTTT GGAAATGACA GCACTTGGTG CAATTTCCTT TGTTTCAGAA 5940 
TGCGTAGAGC GTGTGCTTGG CGACAGTTTT TCTAGTTAGG CCACTTCTTT TTTCCTTCTC 6000 
TCCTCATTCT CCTAAGCATG TCTCCATGCT GGTAATCCCA GTCAAGTGAA CGTTCAAACA 6060 
ATGAATCCAT CACTGTAGGA TTCTCGTGGT GATCAAATCT TTGTGTGAGG TCTATAAAAT 6120 
ATGGAAGCTT ATTTAXTTTT CGTTCTTCCA TATCAGTCTT CTCTATGACA ATTCACATCC 6180 
ACCACAGCAA ATTAAAGGTG AAGGAGGCTG GTGGGATQAA GAGGGTCTTC TAGCTTTACG 6240 
TTCTTCCTTG CAAGGCCACA GGAAAATGCT GAGAGCTGTA GAATACAGCC TGGGGTAAGA 6300 
AGTTCAGTCT CCTGCTGGGA CAGCTAACCG CAT CTTAT AA CCCCTTCTGA GACTCATCTT 6360 
AGGACCAAAT AGGGTCTATC TGGGGTTTTT GTTCCTGCTG TTCCTCCTGG AAGGCTATCT 6420 
CACTATTTCA CTGCTCCCAC GGTTACAAAC CAAAGATACA GCCTGAATTT TTTCTAGGCC 6480 
ACATTACATA AATTTGACCT GGTACCAATA TTGTTCTCTA TATAGTTATT TCCTTCCCCA 6540 
CTGTGTTTAA CCCCTTAAGG CATTCAGAAC AACTAGAATC ATAGAATGGT TTGGATTGGA 6600 
AGGGGCCTTA AACATCATCC ATTTCCAACC CTCTGCCATG GGCTGCTTGC CACCCACTGG 6660 
CTCAGGCTGC CCAGGGCCCC ATCCAGCCTG GCCTTGAGCA CCTCCAGGGA TGGGGCACCC 6720 
ACAGCTTCTC TGGGCAGCCT GTGCCAACAC CTCACCACTC TCTGGGTAAA GAATTCTCTT 6780 
TTAACATCTA ATCTAAATCT CTTCTCTTTT AGTTTAAAGC CATTCCTCTT TTTCCCGTTG 6840 
CTATCTGTCC AAGAAATGTG TATTGGTCTC CCTCCTGCTT ATAAGCAGGA AGTACTGGAA 6900 
GGCTGCAGTG AGGTCTCCCC ACAGCCTTCT CTTCTCCAGG CTGAACAAGC CCAGCTCCTT 6960 
CAGCCTGTCT TCGTAGGAGA TCATCTTAGT GGCCCTCCTC TGGACCCATT CCAACAGTTC 7020 
CACGGCTTTC TTGTGGAGCC CCAGGTCTGG ATGCAGTACT TCAGATGGGG CCTTACAAAG 7080 
GCAGAGCAGA TGGGGACAAT CGCTTACCCC TCCCTGCTGG CTGCCCCTGT TTTGATGCAG 7140 
CCCAGGGTAC TGTTGGCCTT TCAGGCTCCC AGACCCCTTG CTGATTTGTG TCAAGCTTTT 7200 
CATCCACCAG AACCCACGCT TCCTGGTTAA TACTTCTGCC CTCACTTCTG TAAGCTTGTT 7260 
TCAGGAGACT TCCATTCTTT AGGACAGACT GTGTTACACC TACCTGCCCT ATTCTTGCAT 7320 
ATATACATTT GAGTTCA*GT TTCCTGTAAC AGGACAGAAT ATGTATTCCT CTAACAAAAA 7380 
TACATGCAGA ATTCCTAGTG CCATCTCAGT AGGGTTTTCA TGGCAGTATT AGCACATAGT 7440 
CAATTTGCTG CAAGTACCTT CCAAGCTGCG GCCTCCCATA AATCCTGTAT TTGGGATCAG 7500 
TTACCTTTTG GGGTAAGCTT TTGTATCTGC AGAGACCCTG GGGGTTCTGA TGTGCTTCAG 7560 
CTCTGCTCTG TTCTGACTGC ACCATTTTCT AGATCACCCA GTTGTTCCTG TACAACTTCC 7620 
TTGTCCTCCA TCCTTTCCCA GCTTGTATCT TTGACAAATA CAGGCCTATT TTTGTGTTTG 7580 
CTTCAGCAGC CATTTAATTC TTCAGTGTCA TCTTGTTCTG TTGATGCCAC TGGAACAGGA 7740 
TTTTCAGCAG TCTTGCAAAG AACATCTAGC TGAAAACTTT CTGCCATTCA ATATTCTTAC 7800 
CAGTTCTTCT TGTTTGAGGT GAGC CAT AAA TTACTAGAAC TTCGTCACTG ACAAGTTTAT 7860 
GCATTTTATT ACTTCTATTA TGTACTTACT TTGACATAAC ACAGACACGC ACATATTTTG 7920 
CTGGGATTTC CACAGTGTCT CTGTGTCCTT CACATGGTTT TACTGTCATA CTTCCGTTAT 7980 
AACCTTGGCA ATCTGCCOVG CTGCCCATCA CAAGAAAAGA GATTCCTTTT TT ATT ACTTC 8040 
TCTTCAGCCA ATAAACAAAA TGTGAGAAGC CCAAACAAGA ACTTGTGGGG CAGGCTGCCA 8100 
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TCAAGGGAGA GACAGCTGAA GGGTTGTGTA GCTCAATAGA ATTAAGAAAT AATAAAGCTG 8160 
TGTCAGACAG TTTTGCCTGA TTTATACAGG CACGCCCCAA GCCAGAGAGG CTGTCTGCCA 8220 
AGGCCACCTT GCAGTCCTTG GTTTGTAAGA TAAGTCATAG GTAACTTTTC TGGTGAATTG 8280 
CGTGGAGAAT CATGATGGCA GTTCTTGCTG TTTACTATGG TAAGATGCTA AAATAGGAGA 8340 
CAGCAAAGTA ACACTTGCTG CTGTAGGTGC TCTGCTATCC AGACAGCGAT GGCACTCGCA 8400 
CACCAAGATG AGGGATGCTC CCAGCTGACG GATGCTGGGG CAGTAACAGT GGGTCCCATG 8460 
CTGCCTGCTC ATTAGCATCA CCTCAGCCCT CACCAGCCCA TCAGAAGGAT CATCCCAAGC 8520 
TGAGGAAAGT TGCTCATCTT CTTCACATCA TCAAACCTTT GGCCTGACTG ATGCCTCCCG 8580 
GATGCTTAAA TGTGGTCACT GACATCTTTA TTTTTCTATG ATTTCAAGTC AGAACCTCCG 8640 
GATCAGGAGG GAACACATAG TGGGAATGTA CCCTCAGCTC CAAGGCCAGA TCTTCCTTCA 8700 
ATGATCATGC ATGCTACTTA GGAAGGTGTG TGTGTGTGAA TGTAGAATTG CCTTTGTTAT 8760 
TTTTTCTTCC TGCTGTCAGG AACATTTTGA ATACCAGAGA AAAAGAAAAG TGCTCTTCTT 8820 
GGCATGGGAG GAGTTGTCAC ACTTGCAAAA TAAAGGATGC AGTCCCAAAT GTTCATAATC 8880 
TCAGGGTCTG AAGGAGGATC AGAAACTGTG TATACAATTT CAGGCTTCTC TGAATGCAGC 8940 
TTTTGAAAGC TGTTCCTGGC CGAGGCAGTA CTAGTCAGAA CCCTCGGAAA CAG3AACAAA 9000 
TGTCTTCAAG GTGCAGCAGG AGGAAACACC TTGCCCATCA TGAAAGTGAA TA&CCACTGC 9060 
CGCTGAAGGA ATCCAGCTCC TGTTTGAGCA GGTGCTGCAC ACTCCCACAC TGAAACAACA 9120 
GTTCATTTT^ ATAGGACTTC CAGGAAGGAT CTTCTTCTTA AGCTTCTTAA TTATGGTACA 9180 
TCTCCAGTTG GCAGATGACT ATGACTACTG ACAGGAGAAT GAGGAACTAG CTGGGAATAT 9240 
TTCTGTTTGA, CCACCATGGA GTCACCCATT TCTTTACTGG TATTTGGAAA TAA7AATTCT 9300 
GAATTGCAAk GCAGGAGTTA GCGAAGATCT TCATTTCTTC CATGTTGGTG m ACAGCACAGT 9360 
TCTGGCTATG AAAGTCTGCT TACAAGGAAG AGGATAAAAA TCATAGGGAT AATAAATCTA 9420 
AGTTTGAAGA CAATGAGGTT TTAGCTGCAT TTGACATGAA GAAATTGAGA CCTCTACTGG 9480 
ATAGCTATGG TATTTACGTG TCTTTTTGCT TAGTTACTTA TTGACCCCAG CTGAGGTCAA 9540 
GTATGAACTC AGGTCTCTCG GGCTACTGGC ATGGATTGAT TACATACAAC TGTAATTTTA 9600 
GCAGTGATTT AGGGTTTATG AGTACTTTTG CAGTAAATCA TAGGGTTAGT AA7GTTAATC 9660 
TCAGGGAAAA. AAAAAAAAAG CCAACCCTGA CAGACATCCC AGCTCAGGTG GAAr.TCAAGG 9720 
ATCACAGCTC AGTGCGGTCC CAGAGAACAC AGGGACTCTT CTCTTAGGAC CTTT ATGTAC 9780 
AGGGCCTCA* GATAACTGAT GTTAGTGAGA AGACTTTCCA TTCTGGCCAC AG~CAGCTG 9840 
AGGCAATCCT GGAATTTTCT CTCCGCTGCA CAGTTCCAGT CATCCCAGTT TGTACAGTTC 9900 
TGGCACTTTT TGGGTCAGGC CGTGATCCAA GGAGCAGAAG TTCCAGCTAT GGTCAGGGAG 9960 
TGCCTGACCG TCCCAACTCA CTGCACTCAA ACAAAGGCGA AACCACAAGA G7GGCTTTTG 1002O 
TTGAAATTGC AGTGTGGCCC AGAGGGGCTG CACCAGTACT GGATTGACCA GGAGGCAACA 10080 
TTAATCCTCA GCAAGTGCAA' TTTGCAGCCA TTAAATTGAA . CTAACTGATA CTACAATGCA 10140 
ATCAGTATCA ACAAGTGGTT TGGCTTGGAA GATGGAGTCT AGGGGCTCTA CAG3AGTAGC 1020O 
TACTCTCTAA TGGAGTTGCA TTTTGAAGCA GGACACTGTG AAAAGCTGGC CTGCTAAAGA 10260 
GGCTGCTAAA CATTAGGGTC AATTTTCCAG TGCACTTTCT GAAGTGTCTG CAGTTCCCCA 10320 
TGCAAAGCTG CCCAAACATA GCACTTCCAA TTGAATACAA TTATATGCAG GCG7ACTGCT 10380 
TCTTGCCAGC ACTGTCCTTC TCAAATGAAC TCAACAAACA ATTTCAAAGT CTAGTAGAAA 1044O 
GTAACAAGCT TTGAATGTCA TTAAAAAGTA TATCTGCTTT CAGTAGTTCA GCTTATTTAT 1050O 
GCCCACTAGA. AACATCTTGT ACAAGCTGAA CACTGGGGCT CCAGATTAGT GGiAAAACCT 10560 
ACTTTATACA ATCATAGAAT CATAGAATGG CCTGGGTTGG AAGGGACCCC AAGGATCATG 10620 
AAGATCCAAC AGCCCCGCCA CAGGCAGGGC CACCAACCTC CAGATCTGGT ACTAGACCAG 10680 
GCAGCCCAGG GCTCCATCCA ACCTGGCCAT GAACACCTCC AGGGATGGAG CATCCACAAC 10740 
CTCTCTGGGC AGCCTGTGCC AGCACCTCAC CACCCTCTCT GTGAAGAACT TTTCCCTGAC 10800 
ATCCAATCTA AGCCTTCCCT CCTTGAGGTT AGATCCACTC CCCCTTGTGC TATCACTGTC 10860 
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TACTCTTGTA AAAAGTTGAT TCTCCTCCTT 
GCCTTCTTCT CTTCTGCAGG ATGAACAAGC 
GGTGCTCCAG CCCTCTGATC ATCTTTGTGG 
CATCTTTCCT GTACTGGGGG CCCCSGGCCT 
GAGCAGAGTA AAGAGGGACA ATCACCTTCC 
AGCCCTGGAT ACAACTGGCT TTCTGAGCTG 
ACAGGAACAA TACAACAGGT GCTGATGGCC 
GGTAGATCTT AGATGAGGAA CGTTGAAGTT 
ATACTCCTGC CTGATACCTC ACCCCACCTG 
CAGGGCCCTG ATGAACCCGG CACTGCTTCA 
TTGCACCTAT GAATACACAA ACAATGTGTT 
AATTTGCATT GTCAGGAAAT GGTTTAGTAA 
TGGCTGTTTT TATGGCTGTT AGTAGTGGTA 
AATCAAGACT GTAGATATTG CAACAGACTA 
TACTTCCCAC ATTGTATAAG AAATTTGGCA 
ATTTCTGTAT ACTCAAGAGG GCGTTTTTGA 
TGGGAGGAAG TTAAAAGAAG AGGCAGGTGC 
ACACTGGCAA CATGAGGTCT TTGCTAATCT 
TAGGG 11945 



TTTGGAAGGT TGCAATGAGG TCTCCTTGCA 10920 
CCAGCTCCCT CAGCCTGTCT TTATAGGAGA 10980 
CCCTCCTCTG GACCCGCTCC AAGAGCTCCA 1104 0 
GAATGCAGTA CTCCAGATGG GGCCTCAAAA 11100 
TCACCCTGCT GGCCAGCCCT CTTCTGATGG 11160 
CAACTTCTCC TTATGAGTTC CACTATTAAA 11220 
AGTGCAGAGT TTTTCACACT TCTTCATTTC 11280 
GTGCTTCTGC GTGTGCTTCT TCCTCCTCAA 11340 
CCACTGAATG GCTCCATGGC CCCCTGCAGC 11400 
GATGCTGTTT AATAGCACAG TATGACCAAG 11460 
GCATCCTTCA GCACTTGAGA AGAAGAGCCA 11520 
TTCTGCCAAT TAAAACTTGT TTATCTACCA 11580 
CACTGATGAT GAACAATGGC TATGCAGTAA 11640 
TAAAATTCCT CTGTGGCTTA GCCAATGTGG 11700 
AGTTTAGAGC AATGTTTGAA GTGTTGGGAA 11760 
CAACTGTAGA ACAGAGGAAT CAAAAGGGGG 11820 
AAGAGAGCTT GCAGTCCCGC TGTGTGTACG 11880 
TGGTGCTTTG CTTCCTGCCC CTGGCTGCCT 11940 
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SEQIDNO:8 




ScSatgtggtaaaatcgataaggatccgtosagcggccgc 285 
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SEQ ID NO: 9 



200 



400 



600 



aoo 



GGCGACAvJ A tvjCv3VjwAuv^^« wwwwwww 

CAACCGTCGCCCCGTGACGGCACCGCCCCGCCCCCGTGACGCC3GTGCGGG. 
CGCCGGGGeCGTGGGGCTGAGCGCTGCGGCGGGGCCGGGCCGGGCCGGGG 
CGGGAGCTGAGCGCGGCGCGGCTGCGGGCGGCGCCCCCTCCGGTGCAATA 
TGTTCAAGAGAATGGCTGAGTTCGGGCCTGACTCCGC-GGGCAGGGTGAAG 
GTGCGGCGCGGGCGGAGGGACGGGGCGGGCGCGGGGCCGCCCGGCGGGTG 
CCGGGGCCTCTGCCGGCCCGCCCGGCTCGGGCTGCTGCGGCGCTTACGGG 



TCTCGTAGGAC^TGTCCGCCTACGTGAAAAAAATCCaGTrCAAGCTGCAC 
' GAGAGCTACGGGAATCCTCTCCGAGGTGGGTGTTGCGTCGGGGGGTTTGC 
1000 ^JcGcSGGTCCCGCTaAC^CGTCC^CCTCATCTTTCTTTCGTGCCGC 

TTGAAATCATCATCA^GATATTTTTCATTGATCCAAACGAGCGACCCGTA 
AGTACGCTCAGCTTCTCGTAGTGCTTCCCCCGTCCTGGCGGCCCGGGGCT 

i 2 oo SSgctcgctgctgccggtcacagtcccgccagccgcggagctgactg 

AGCTCCCTTTCCCGGGACGTGTGCTCTGTGTTCGGTCAGCGAGGCTATCG 
GGAGGGCTTTGGCTGCATTTGGCTTCTCTGGCGCTTAGCGCAGGAGCACG 
SGSScG^aAACTACAGCTGTGAGAAGGCCGTC^CCG^ 

GGAAATGCTTT AG AGAAGGTCT CTGTGGTAGTT CTT A i GCATCT ATC CT A 
AAGCACTTGGCCAGACAATTTAAAGACATCAAGCAGCATTTATAGCAGGC 

a^otSScgaatactgawttaagtaactctgctcacgttgtatga 

1600 ^^^CTGAAAGCCA^AA^^-^T^A 
GTAAGAACAGCTGCCACTGTTTTGTATCTAGGAGATA.-^ i GGTGTTTCCC 

?a^ctcaagctgataaaactctgtctttgtatctaggtaaccctgt 
ScaSStgaI^ 

TCAACCGAGCCTa^CTTTATTTAAAAAAAATTATTGA.GGTGCTGTGTAT 
TTTGGTCCTTCCCTAGATATTTCAAGATCCTACTGCCAXGATGCAGOVAC 
TGCraACG^GTttOTTCAGCTGACACTTGGTGCTTACAAGCATGAAAtt 
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2000 GAGTGTAAGTGCAAAATGAGGATACCTTCGCCGACCGTCATTCACTACTA 
ATGTTTTCTGTGGGATGTGATCGTACAGTGAGTTTGGCTGTGTGAAATTT 
GAATAGCTTGGTATTGGCAGTGAtGACGTGATCGATGCCTTGCTTATCAT 
GTTTGAAATGAAGTAGAATAAATGCAGCCTGCTTTATXTGAGATAGTTTG 

2200 GTTCATTTTATGGAATGCAAGCAAAGATT ATACTTCCTCACTGAATTGCA 
CTGTCCAAAGGTGTGAAATGTGTGGGGATCTGGAGGACCGTGACCGAGGG 
ACAT T GGATCGCTATCTCCCATTTCTTTTGCTGTTACCAGTTCAGATTTT 
CTTTTCACCTAGTCTTTAATTCCCAGGGTTTTGTTTTTTCCTTGGTCATA 

2400 GTXTTTGTTTTTCACTCTGGCAAATGATGTTGTGAATTACACTGCTTCAG 
CCACAAAACTGATGGACTGAATGAGGTCATCAAACAAACTTTTCTTCTTC 
CGTATTTCCTTTTTTTTCCCCCACTTATGATTTTTACTGCTGTTGTTGAG 
TCTG^AAGGCTAAAAGTAACTGTTTTGTGCTTTTTCAGGACGTGTGCTTT 

2600 CCAAATTACTGCCAeATATATAAAGAAAGGTTGGAATTTTAAAGATAATT 
CATGTTTCTTCTTCTTTTTTGCCACCACA.GTTGCAGATCTTGAAGTARAA. 
ACCAGGGAAAAGCTGGAAGCTGCCAAAAAGAAAACCAGTTTTGAAATTGC 
TGAGCTTAAA.GAAAGGTTAAAAGCAAGTCGTGAAACCATCAACTGCTTAA 

2800 AGAGTGAAATCAGAAAACTCGAAGAGGATGATCAGTCTAAAGATATGTGA 
TGAGTGTTGACTTGGCAGGGAGCCTATAATGAGAATGAAAGGACTTCAGT 
CGTGGAGTTGTATGCGTTCTCTCCAATTCTGTAACGGAGACTGTATGAAT 
TTC aT TTGCAA ! VTCACTGCAGTGTGTGACAACTGACTTTTTATAAATGGC 

3000 AGAAAACAAGAATG AATGTAT C CT CATTTT ATAGTTAAAATCT ATGGGT A 
TGTACTGGTTTATTTCAAGGAGAATGGATCGTAGAGACTTGGAGGCCAGA 
TTGC^GCTTGTATTGACTGCATTTGAGTGGTGTAGGAACATTTTGTCTAT 
GGTCCCGTGTTAGTTTACAGAATGCCACTGTTCACTGTTTTGTTTTGTAT 

3200 TTTACTTTTTCTACTGCAACGTCAAGGTTTTAAAAGTTGAAWVTAAAACA 
TGCAGGTTTTTTTTAAAT ATTTTTTTGT CTCT AT CCAGTTTGGGCTT CAA 
GTATTATTGTTAACAGCAAGTCCTGATTTAAGTCAGAGGCTGAAGTGTAA 
TGGTATTCAAGATGCTTAAGTCTGTTGTCAGCAAAACAAAAGAGAAAACT 

3 400 TCAXVAAATCAGGAAGTTGGCATTTCTAATAACTTCTTTATCAACAGATA 
• AGAGTTTCTAGCCCTGCATCTACTTTCACTTATGTAGTTGATGCCTTTAT 
ATTTTGTGTGTTTGGATGCAGGAAGTGATTCCTACTCTGTTATGTAGATA 
TTCTATTTAACACTTGTACTCTGCTGTGCTTAGCC'TTTCCCCATGAAAAT 

3S00 TCAGCGGCTGTAAATCCCCCTCTTCTrTTGTAGGCTCATACAGATGGCAG 
ACCCTCAGGCTTATAAAGGCTTGGGCATCTTCTTTACTGCTTTGAGATTC 
TGTG^TGCAGTAACCTCTGCCAGAGAGGAGAAAAGCCCCACAAACCTCAT 
CCCC^TCTTCTATAGCAATCAGTATTACTAATGCTTTGAGA»lCAGAGCAC 

3800 TGGTTTGAAACGTTTGATAATTAGC^TrTAACATGGCTTGGTAAAGATGC 
AGA a CTGAAACAGCTGTGACAGTATGAACTCAGTATGGAGACTTCATTAA 
GACAAA.CAGCTGTTAAAATCAGGCATGTTTCATTGAGGAGGACGGGGCAA 
CTTGC^CCAGTGGTGCCCACACAAATCCTTCCTGGCGCTGCAGACCAATT 
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4 00 0 TTTCTGGCATTCTGACTGCCGTTGCTGCTGGTCACAGAGAGCAACTATTT 
TTATCAGCCACAGGCAATTTGCTTGTAGTATTTTCCAAGTGTTGTAGGTA 
AGTATAAATGCATCGGCTCCAGAGCACTTTGAGTATACTTATTAAAAACA 
TAAATGAASGACAAATTAGCTTTGCTTGGGTGCACAGAACArrTTTAGTT 

4200 CCAGCCTGCTTTTTGGTAGAAGCCCTCTTCTGAGGCTAGAACTGACTTTG 
ACAAGTAGAGAAACTGGCAACGGAGCTATTGCTATCGAAGGATCCTTGTT 
AACAAAGTTAATCGTCTTTTAAGGTTTGGTTTATTCATTAAATTTGCTTT 
TAAGCTGTAGCTGAAAAAGAACGTGCTGTCTTCCATGCACCAGGTGGCAG 

4 . 0 0 C^GTGCAAAGTGCTCTCTGGTCTCACCAGCCTTTTAATTGCCGGGATT 
CTGGCACGTCTGAGAGGGCTCAGACTGGCTTCGTTTGTTTGAACAGCGTG 
TACTGCTTTCTGTAGACATGGCCGGTTTCTCTCCTGCAGCTTATGAAACT 
GTTCACACTGAACACACTGGAACAGGTTGCCCAAGGAGGCCGTGGATGCC 

4600 CCATCCCTGGAGGCATTCAAGGCCAGGCTGGATGTGGCTCTGGGCAGCCT 

^Sggtggt^cgatcctgcagatagcagcggggttgaaactcgatg 

ATCACTCTGGTCCTTTTCAACCCAGGCTATTCTATGATTCTATGATTCAA 

SgStgatatgtactgagagaggaaacaaacacaagtgctactgttt 
4300 g^mtcttgttcatttggtaaaagagtcaggttttaaaattcaaaatct 

G^G^TGGSTTTTTrrTTTTTTATTTATTATTTCTTTGGGGTTCT 

S^gatgcSatctttctctgccaggactgtgtgacaatgggaacgaa 

sooo SsSSSS3SS5!SSSSSSSSSS 
====== 

5200 ===== 

A^GGAASxCT^^GCGTTCACrTATGCTACATTCATAGTATTTCCAT 

CACCGCCTT CCT ATG CAC CTG ACC AACTTC CAGAGG AAAAGC CTATTGAA 
5400 A^CGAGMT^^^CCAAAA.GA^CTCATTTGCATTGGAATATGTAGTA 

5400 ^^^^^^^^^^l 

GTTAAATGAGTGGCTGGCACTTTTTATTCTCACAGCTGTGGGGAATTCTG 
TCCTCTMG ACAGAAACAATTTT AAT CTGTT C CACTGGTGACTGCTTTGT 

ssoo SS^^^^S 
^g^gS^ccc^cctaaagctcaattcmotcc^c^ 



TCAACTCTCTATAGCTAACATGAAGAATCTTCAAAAGTTAGGTCTGAGGG 
ACTTAAGGCTAACTGTAGATGTTGTTGCCTGGTTTCTGTGCTGAAGGCCG 



5800 

TGTAGTAGTTAGAGCATTCAACCTCTAG 



FIG. 5C 



WO 03/024199 



16/31 



PCT/US02/30156 



SEQ ID NO: 10 



1 TGCCGCCTTCTTTGATATTCACTCTGTTGTATTTCATCTCTTCTTGCCGA. 
TGARAGGATATAACAGTCTGTATAACAGTCTGTGAGGAAATACTTGGTAT 
TTCTTCTGATCAGTGTTTTTATAAGTAATGTTGAATATTGGATAAGGCTG 
151 TGTGT CCTTTGTCTTGGGAGAC AAAGC C CACAGC AGGTGGTGGTTGGGGT 
GGTGGCAGCTCAGTGACAGGAGAGGTTTTTTTGCCTGTTTTTTTTTTTTT 
TTTTTTTTTTAAGTAAGGTGTTCTTTTTTCTTAGTAAATTTTCTACTGGA 

2 01 CTGT ATGTTTT GACAGGT CAGAAACATTT CTT CAAAAGAAG^AC CTTTTG 
GAAACTGTACAGCCCTTTTCTTTCATTCCCTTTTTGCTTTCTGTGCCAAT 
GCCTTTGGTTCTGATTGCATTATGGAAAACGTTGATCGGA^CTTGAGGTT 
a si TTTArrTATAGTGTGGCTTGAAAGCTTGGATAGCTGTTGTTACACGAGAT 
ACCTTATTAAGTTTAGGCCAGCTTGATGCTTTATTTTTTCCCTTTGAAGT 
AGTGAGCGTTCTCTGGTTTTTTTCCTTTGAAACTGGTGAGGCTTAGATTT 
6 01 TTCTAATGGGATTTrTTACCTGATGATCTAGTTGCATACCCAAATGCTTG 
TAAATCTTTTCCTAGTTAACATGTTGATAACTTCGGATTTACATGTTGTA 
- TATACTTGTCATCTGTGTTTCTAGTAAAAATATATGGCATTTATAGAAAT 
751 ACGTAATTCCTGATTTCCTTTTTTTTTATCTCTATGCTCTGTGTGTACAG 
GTCAAACAGACTT CACTCCTATTTTTATTTATAGAATTTTATATGCAGTC 
TGTCGTTGGTTCTTGTGTTGTAAGGATACAGCCTTAAATTTCCTAGAGCG 
901 ATGCTCAGTAAGGCGCGTTGTCACATGGGTTCAAATGTAAAACGGGCACG 
TTTG^CTGCTGCCTTCCCGAGATCCAGGACACTAAACTGCTrCTGCACTG 
AGGTATAAATCGCTTCAGATCCCAGGGAAGTGCAGATCCACGTGCATATT 
10 51 CTTAAAGAAGAATGAATACTTrCTAAAATATTTTGGCATAGG.AAGCAAGC 
TG^A^G^ATTTGTTTGGGACTTAAATTATTTTGGTAACGGAGTGCATAGG 
TmAAACACAGTTGCAGCATGCTAACGAGTCACAGCGTTTATGCAGAAG 
1201 x^TGCCTGGATGCCTGTTGCAGCTGTTTACGGCACTGCCTTGCAGTGnG 

SSgata^ 

1351 TCTT A6 CAGT AGT AG ATG AGTT ACT AT G AAACAGAGAAGTT C CT CAG i iG 
GATATTCTCATCGGATGTCTTTITTCCCATGTTCGGCAAflGTATGATAAA 
G^ScTATTTGTAAATTATGCACTTGTTAGTTCCTGAATCCTTTCTAT 

1501 SSSStATTM^ 

CXT^ftA^CTTCTAAAiGCTTCTTTGGAAATACACTGACTTGATTGAAGTCT 

SSa^Saaacactacttacctttga 

1651 S^TSA^TTCCGCCTATTCAXACCATGTAATGTAATmAC 
ScrcS^roCT^CACTTTGGAATATATTCAAGTAATAGACTTTGKCT 

caccctcttgtgtactgtattttgtaatagaaaatattttaaactgtgca 
1801 tatgmtattacattatgaaagagacattctgctgatcttcaaatc 

SSg^t^aaaaaaaaaaaaaaaaagtaatataaa^ 
1951 cttttacaagtgaaatacatt cct atttggtaaacagtt acatttttatg 

SSSS^GCTCaCTTTCTAAACATAAGCK^TATTCTOTCC 
T^ic^T^TTTCCTCATTGCCAATTTGCACAAGGATGTCTGGGTAA 
2101 ACTATT^AAGAAATGGCTTTGAAATACAGCATGGGAGCTTGTCT 

gStcca^agttgcactccaaaatgtcaggaaatggatgtctctcagaat 
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2251 TCCAGCAGGCCAAAGAGTCTGCTGAATGTTGTGTTGCCGGAGACCTGTAT 
TTCTCAACAAGGTAAGATGGTATCCTAGCAACTGCGGATTTTAATACATT 
TTCAGCAGAAGTACTTAGTTAATCTCTACCTTTAGGGATCGTTTCATCAT 

2401 TTTTAGATGTTATACTTGAAATACTGCATAACTTTTAGCTTTC^TGGGTT 

TGTGTTGGT CTTAAACTG CAATAGT AGTTTACCTTGTATTG.-AG AAATAA 

2551 AGACCATTTTTATATTAAAAAATACTTTTGTCTGTCTTCATTTTGACTTG 
TCTGATATCCTTGCAGTGCCCATTATGTCAGTTCTGT CAGATATT CAGAC 
ATCAAAACTTAACGTGAGCTCAGTGGAGTTACAGCTGCGGTTTTGATGCT 

2701 GTTATT ATTT CTGAAACTAGAAATGATGTTGTCTTCAT CTGCT CATCAAA 
CACTTCATGCAGAGTGTAAGGCTAGTGAGAAATGC^TACATTTATTGATA 
GTTTTTTAAAGTCAACTTTTTATCAGATTTTTTTTTCATTTK 

■2851 TTGTTTTCTAGACTGCATAGCrrCTGAATCTGAAATGCaGTCTGATTGGC 
ATGA^GAAGCACAGCACT CTT CAT CTTACTTAAACTT CATTTTGGAATGA 
AGGAAGTTAAGCAAGGGCACAGGTCCATGAAATAGAGAC^GTGCGCTCAG 

3001 GAGAAAGTGAACCTGGATTTCTTTGGCTAGTGTTCTAAATCTGTAGTGAG 
GAAAGTAACACCCGATTCCTTGAAAGGGCTCCAGCTTTAATGCTTCCAAA 
TTGAAGGTGGCAGGCAACTTGGCCACTGGTTATTTACTGCATTATGTCTC 

3151 AGTTTCGCAGCTAACCTGGCTTCTCCACTATTGAGCATGGACTATAGCCT 
GGCTTCAGAGGCCAGGTGAAGGTTGGGATGGGTGGAAGGAGTGCTGGGCT 
GTGGCTGGGGGGACTGTGGGG ACT C CAAGCTGAGCTT GGGGTGGGCAGCA 

3301 CAGGGAAAAGTGTGGGTAACTATTTTTAAGTACTGTGTTGCAA^CGTCTC 
ATCTGCAAATACGTAGGGTGTGTACTCTCGAAGATTAA.C a .GTGTGGGTTC 
AGTAATATATGGATGAATTCAC^GTGGAAGCATTCA a .GGGTAGATCATCT 

3451 AACGACACCAGATCATCAAGCTATGATTGGAAGCGGTATCAG^-GAGCGA 
GGAAGGTAAG CAGT CTTCATATGTTTTC CCT C GA.CGT AAAGC\GTCTGGG 
AAAGTAGCACCCCTTGAGCAGAGACAAGGAAATAATTCAGGAGCATGTGC 

3601 TAGGAGAACTTTCTTGCTGAATTCTACTTGCAAGAGCTTTGATGCCXGGC 
TTCTGGTGCCTTCTGCAGCACCTGCAAGGCCCAGAGCCTGTGGTGAGCTG 
GAGGGAAAGATTCTGCTCAAGTCCAAGCTTCAGCAGGTCATTGTCTTTGC 

3751 TTCTTCCCCCAGCACTGTGCAGCAGAGTGGAACTGATGTCGAhGCCTCCT 
GTCCACTACCTGTTGCTGCAjGGCAGACTGCTCTCAGAAAAAGAGAGCTAA 

ctctatgccatagtctgaaggtaaaatgggtttta^ 

3901 AGGCAAAACCGGGTGCCCCATGAGAAGAAAGCAGTGGTAAAC^TGGTAGA 
AAAGGTGCAGAAGCCCCCAGGCAGTGTGACAGGCCCCTCCTGCCACCTAG 
AGGCGGGy^CAAGCTTCCCTGCCTAGGGCrCTGCCCGCGAAGXGCGTGTT 

4051 TCTTTGGTGGGTTTTGTTTGGCGTTTGGTTTTGAGATTTAGACACAAGGG 
AAGCCTGAAAGGAGGTGTTGGGCACTATTTTGGTTTGTAAAGCCTGTACT 
TCAAATATATATTTTGTGAGGGAGTGTAGCGAATTGGCCAATTTAAAATA 

4201 AAGTTGCAAGAGATTGAAGGCTGAGTAGTTGAGAGGGTAACACGTTTAAT 
GAGATCTTCTGAAACTACTGCTTCTAAACACTTGTTTGAGTGGTGAGACC 
TTGGATAGGTGAGTGCTCTTGTTACAXGTCTGATGCACTTGCTTGTCCTT 

4351 TTCCATCCACATCCATGCATTCCACATCCACGCATTTGTCACTTATCCCA 
TATCTGTCATATCTGACATACCTGTGTCTTCGTCACTTGGTCAGAAGAAA 
CAGATGTGATAATCCCCAGCCGCCCCAAGTTTGAGAAGATGGCAGTTGCT 

4501 TCTTTCCCTTTTTCCTGCTAAGTAAGGATTTTCTCCTGGCTTTGACACCT 
CACGAAATAGT CTT C CTGCCTT ACATT CTGGG C ATTATTTCAAATATCTT 
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TGGAGTGCGCTGCTCTCAAGTTTGTGTCTTCCTACTCTTAGAGTGAATGC 

4651 T CTT AGAGTGAAAGAGAAGGAAGAGAAGATG7TGGCCGCAGTT CT CTGAT 
GAACAC AC CTCT GAAT AATGGC CAAAGGTGGGTGGGTTT C7CTGAGGAAC 
GGGCAGCGTTTGCCTCTGAAAGCAAGGAGCTCTGCGGAGTTGCAGTTATT 

4801 TTGCAACTGATOTTGGAACTGGTGCTTAAAGCAGATTCCCT AGGTTCCCT 
GCTACTTCTTTTCCTTCTTGGCAGTCAGTTTATTTCTGACAGACAAACAG 
C CAC C C CCACTGCAGGCTT AGAAAGTATGTGG CT CTGC CTGGGTGTGTT A 

4951 CAGCTCTGCCCTGGTGAAAGGGGATTAAAACGGGCACCATTCATCCCAAA 
CAGGAT CCT CATTCATGGAT CAAGCTGTAAGGA^CTTGGGCT C CAACCT C 
AAAACATTAATTGGAGTACGAATGTAATTAAAACTGCATTCTCGCATTCC 

5101 TAAGTCATTTAGTCTGGACTCTGCAGCATGTAGGTCGGC^GCTCCCACTT 
T CTCAAAGACCACTGATGGAGGAGTAGTAAA^-ATGGAGAC CGATTCAGAA 
CAACCAACGGAGTGTTGCCGAAGAAACTGATGGAAATAATGCATGAATTG 

5251 TGTGGTGGACATTTTTTTTAAATACATAAACTACTTCAAATGAGGTCGGA 
GAAGGTCAGTGTTTTATTAGCAGCC^TAA^CCAGGTGAGCGAGTACCAT 
TTTTCTCTACAAGAAAAACGATTCTGAGCTCTGCGTAAGTATAAGTTCTC 

5401 CATAGCGGCTGAAGCTCCCCCCTGGCTGCCTGCCATCTCAGCTGGAGTGC 
AGTGCCATTTCCTTGGGGTTTCTCTCACAGC^GTAATGGGACAATACTTC 
ACAAAAATTCTTTCTTTTCCTGTCATGTGGGATCCCTACTGTGCCCTCCT 

5551 GGTTTTACGTTAC CCC CTGACTGTT C CATT CAGCGGTTTGGAAAGAGAAA 
AAGAATTTGGAAATAAAACATGTCTACGTTATCACCTCCTCCAGCATTTT 
GGTTTTTAATTATGTCAATAACTGGCTTAGATTTGGAAATGAGAGGGGGT 

5701 TGGGTGTATTACCGAGGAACAAAGGAAGGCTTATATAAACTCAAGTCTTT 
TATTTAGAGAACTGGCAAGCTGTCAAAAACAAAAAGGCCTTACC^ 
TTAAGTGAATAGCCGCTATAGCCAGCAGGGCCAGCACGAGGGATGGTGCA 

5851 CTGCTGGCACTATGCCACGGCCTGCTTGTGACTCTGAGAGCAACTGCTTT 
GGAAATGACAGCACTTGGTGCAATT7CCTTTGTTTCAGAATGCGTAGAGC 
GTGTGCTTGGCGACAGTTTTTCTAGTTAGGCCACTTCTTTTTTCCTTCTC 

6001 TCCTCATTCTCCTAAGCATGTCTCCATGCTG3TAATCCCAGTCAAGTGAA 
CGTTCAAACAATGAATCCATCACTGTAGGATTCTCGTGGTGATCAAATCT 
TTGTGTGAGGTCTATAAAATATGGAAGCTTATTTATTTTTCGTTCTTCCA 

6151 TATCAGTCTTCTCTATGACAATTCACATCCACCACAGCAJLArTAAAGGTG 
AAGGAGGCTGGTGGGATGAAGAGGGTCTTCTAGCrTTACGrTCTTCCTTG 
CAAGGCCACAGGAAAATGCTGAGAGCTGTAGAATACAGCCTGGGGTAAGA 

6301 AGTTCAGTCTCCTGCTGGGACAGCTAACCGC-.TCTTATAACCCCTTCTGA 
GACTCATCTTAGGACCAAATAGGGTCTATCTGGGXjTTTTTGTTCCTGCTG 
TTCCTCCTGGAAGGCTATCTCACTATTTCACiGCTCCCACGGTTACAAAC 

6451 GAAAGATACAGCCTGAATTTTTTCTAGGCC^C^TTACATAAATTTGACCT 
GGTACCAATATTCTTCTCTATATAGTTATTTCCTTCCCCACTGTGTTTAA 
CCCCTTAAGGCATTCAGAACAACTAGAATCATAGAATGGTTTGGATTGGA 

6601 AGGGGCCTTAAACATCATCCATTTCCAACCC7CTGCCAT&GGCTGCTTGC 
CACCCACTGGCTCAGGCTGCCCAGGGCCCCaTCCAGCCTGGCCTTGAGCA 
CCTCCAGGGATGGGGCACCCACAGCTTCTCTGGGCAGCCTGTGCCAACAC 

6751 CTCACCACTCTCTGGGTAAAGAATTCTCTTTTAACATCTAATCTAAATCT 
CTTCTCTTTTAGTTTAAAGCCATTCCTCTTTTTCCCGTTGCTATCTGTCC 
AAGAAATGTGTATTGGTCTCCCTCCTGCTTATAAGCAGGAAGTACTGGAA 

6901 GGCTGCAGTGAGGTCTCCCCACAGCCTTCTCTTCTCCAGGCTGAACAAGC 
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C CAG CTC CTT CAGCCTGTCT T CGI AC-GAG AT CAT CTTAGTGG C C CT CCT C 
TGG ACC CATT CCAACAGTTC CACGG C7TT CTT GTGGAGC C C CAGGT CTGG 

7051 AXGCAGXACXXCAGATGGGGCCTTACAAAGGCAGAGCAGAXGGGGACAAT 
CGCXXACCCCXCCCTGCTGGCTGCCCCXGXXXXGAXGCAGCCCAGGGTAC 
TGTTGGCCTTTCAGGCTCCCAGACCCCTTGCTGATTTGTGTCAAGCTTTT 

7201 CATCCACCAGAACCCACGCTTCCTGGTTAATACTTCTGCCCTCACTTCTG 
' TAAGCTTGTTTCAGGAGACTTCCA.TTCTTTAGGACAGACTGTGTTACACC 
XACCXGCCCXATXCXXGCAXATATACAXTXCAGXXCAXGTXTCCXGXAAC 

7351 AGGACAGAATATGTATTCCTCTAACAAAAATACATGCAGAATTCCTAGTG 
CCAT CT CAGT AGGGTTTTCATGGCAGT ATT AGC ACAT AGT CAATTTGCTG 
CAAGTACCTTCCAAGCTGCGGCC7CCCATAAATCCTGTATTTGGGATCAG 

7501 TTACCTTTTGGGGTAAGCTTTTGTATCTGCAGAGACCCTGGGGGTTCTGA 
TGTGCTT CAGCTCTGCTCTGTTCTGACTGCAC CATTTTCTAGATCACCCA 
GTTGTT CCTGTACAACTT CCT TGI CCTCCAT C CTTT CCCAGCTTGT ATCT 

7651 TTGAGAAATACAGGCCTATTTTTGTGTTTGCTTCAGCAGCCATTTAATTC 
TTCAGTGTCATCTTGTTCTGTTGATGCCACTGGAACAGGATTTTCAGCAG 
TCTTGCAAAGAACATCTAGCTGAAA a .CTTrCTGCCATT 

7801 CAGTT CTT CTTGXXTGAGGTGAGCCVTAAATT ACTAGAACXT CGTCACTG 
ACAAGTTTATGCATTTTATTACITCTATTATGTACTTACTTTGACATAAC 
ACAGACACGCACATATTTTGCTGvSGATTTCCACAGTGTCTCTGTGTCCTT 

7951 *" CACATGGTTTTACTGT CAT AC77CCGTTATAAC CTTGGCAAT CTGCCCAG 

• CTGC C CAT CACAAGAAAAG AGATT CCTTTTTT ATTACTTCT CTTCAGCCA 
ATAAACAAAATGTGAGAAGCCCAAACAAGAACTTGTGGGGCAGGCTGCCA 

8101 TCAAGGGAGAGACAGCTGAAGGGTTGTGTAGCTCAATAGAATTAAGAAAT 
AATAAAGCTGTGTCAGACAGTTTTGCCTGATTTATACAGGCACGCCCCAA 
GCCAGAGAGGCTGTCTGCCAAG3CCACCTTGCAGTCCTTGGTTTGTAAGA 

8251 XAAGTCAXAGGXAACTXTTCTGG7GAATTGCGTGGAGAAXCAXGAXGGCA 
GTTCTTGCTGTTTACTATGGTA^GATGCTAAAATAGGAGACAGCAAAGTA 
ACACTTGCTGCTGTAGGTGCTCTGCTATCCAGACAGCGATGGCACTCGCA 

8401 CACCAAGATGAGGGATGCTCCC^GCTGACGGATGCTGGGGCAGTAACAGT 
GGGTCCCATGCTGCCTGCTCATTA3CATCACCTCAGCCCTCACCAGCCCA 

• XCAGAAGGATCATCCCAAGCTGACXSAAAGTTGCrCATCTTCTTCACATCA 

8551 TCAAAOTTTTGGCXTCACTGATC^ 

GACAT CTTTATTTTT CTATGATTTCAAGT CAGAAC CTCCGGAT CAGGAGG 
GAACACAXAGTGGGAAXGTACCCTCAGCTCCAAGGCCAGAXCXXCCTXCA 

8701 aTGATCATGCATGCTACTTAGG--tGGTGTGTGTGTGTGAAXGTAGAATTG 
CCTTTGTTATTTTTTCTTCCTGCTGTC^GGAACATTTTGAATACCAGAGA 
AAAAGAAAAGTGCTCTTCTTGGC^TGGGAGGAGTTGTC^ 

8851 TAAAGGATGCAGTCCCAAATGTTCATAATCTCAGGGTCTGAAGGAGGATC 
AGAAACTGTGTATACAATTT CAGGCTT CT CTGAATGCAGCTTTTGAAAGC 
f TGTTCCTGGCCGAGGCAGTACTAGTCAGAACCCTCGGAAACAGGAACAAA 

9001 XGTCTTCAAGGTGCAGCAGGAGGAAACACCTTGCCCATCATGAAAGTGAA 
XAACCACTGCCGCTGAAGGAATCCAGCXCCTGTTTGAGCAGGXGCXGCAC 
ACTCCCACACTGAAACAAC^GTTC^TTTTTATAGGACTTCCAGGAAGGAT 

9151 CTTCTTCTTAAGCTTCTTAATTATGGTACATCTCGAGTTGGCAGATGACT 

"axgacxactgacaggagaatgaggaacxagcxgggaaxaxxtctgtttga 

CCAC CATGGAGTCACCCATTTCTTT ACTGGTATTTGGAAATAATAATTCT 
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93 01 GAATTGGAAAGC^GGAGTTAGCGAAGATCTTC ATTT CTTC CATGTTGGTG 

ACAGCACAGTTCTGGCTATGAAAGTCTGCTTACAAGGAAGAGGATAAAAA . 

9401 TCATAGGGATAATAAATCTAMSTTTGAAGACAATGAGGTTTTAGCTGCAT 
TTGACATGAAGAAATTGAGACttCTACTGGATAGCTATGGTATTTACGTG 
TCTTTTTGCTTAGTTACt TATTGACC CCAGCTGAGGT CAAGTATGAACTC 

9551 AGGTCTCTCGGGCTACTGGCATGGATTGATTACATACAACTGTAATTTTA 
GCAGTGATTTAGGGTTTATGAGTACTTTTGCAGTAAATCATAGGGTTAGT 
AATGTTAATCTCAGGGAAAAAAAAAAAAAGCCAACCCTGACAGACATCCC 

9701 AGCTC^GGTGGAAATCAAGGATCACAGCTCAGTGCGGTCCCAGAGAACAC 
AGGGACTCTTCTCTTAGGACCTTTATGTACAGGGCCTCAAGATAACTGAT 
GTTAGTCAGAAGACTTTCCATTCTGGCCACAGTTCAGCTGAGGCAATCCT 

9851 GGAATTTTCTCTCCGCTGCACAGTTCCAGTCATCCCAGTTTGTACAGTTC 
TGGCACTTTTTGGGTCAGGCCGTGAT CCAAGGAGCAGAAGTTCCAGCTAT 
' GGTC a .GOTAGTGCCTGACCGTCCCAACTCACTGCACTCAAACAAAGGCGA 

10 00 1 AACC^CAAGAGTGGCTTT^^ 

CACCAGTACTGGATTGACCACGAGGCAACATTAATCCTCAGCAAGTGCAA 

TTTGC^GCCATTAAATTGAACTAACTGATACTACAATGCAATCAGTATCA 

10151 ACAAGTGGTTTGGCTTGGAAGATGGAGTCTAGGGGCTCTACAGGAGTAGC 
TACTCTCTAATGGAGTTGCATXTTGAAGCAGGACACTGTGAAAAGCTGGC 
CTCCTAAAGAGGCTGCTAAACATTAGGGTCAATTTTCCAGTGCACTTTCT 

10301 GAAGTGTCTGCAGTTCCCCATGCAAAGCTGC C CAAACATAGCACTTCCAA 
TTGAATACAATTATATGCAGGCGTACTGCTTCTTGCCAGCACTGTCCTTC 
TCAAATGAACTCAAC a JLACAATTTCAAAGTCTAGTAGAAAGTAACAAGCT 

10451 TTGAA T GTCATTAAA^GTATATCTGCTTTCAGT AGTTCAGCTTATTTAT 
GCCCACTAGAAACATCTTGTACAAGCTGAACACTGGGGCTCCAGATTAGT 
GGTAAAACCTACTTTA.TACAATCATAGAATCATAGAATGGCCTGGGTTGG 

10601 AAGGGACCCCAAGGATCATGAAGATCCAACACCCCCGCCACAGGCAGGGC 
CACCAACCTCC^GATCTGGTACTAGACCAGGCAGCCCAGGGCTCCATCCA 
ACCTGGCC\TGAAC^CCTCC a .GGGATGGAGCATCCACAACCTCTCTGGGC 

10751 AGCCTGTGCCAGCACCTCACCACCCTCTCTGTGAAGAACTTTTCCCTGAC 
ATCCAATCTAAGCCTTCCCTCCTTGAGGTTAGATCCACTCCCCCTTGTGC 
XATCACTGTCTACTCTTGTAAAAAGTTGATTCTCCTCCTTTTTGGAAfiGT 

10901 TGCAATGAGGTCTCCTTGCAGCCTTCTTCTCTTCTGCAGGATGAACAAGC 
CCAGCTCCCTCAGCCTGTCTTTATAGGAGAGGTGCTCCAGCCCTCTGATC 
ATCTTTGTGGCCCTGCTCTCK5ACCCGCTCGAAGAGCTCCACATCTTTCCT 

11051 GTACTGGGGGC CCCA.GGCCTGAATGCAGTACTC CAGATGGGGCCTCAAAA 
- GAGC AGAGT AAAGAGGGACAATCACCTTC CTCAC C CTGCTGGCCAGCCCT 
CTTCTGATGGAGCCCTGGATACAACTGGCTTTCTGAGCTGCAACTTCrCC 

11201 TTATCAGTTCCACTATTAAAACAGGAACAATACAACAGGTGCTGATGGCC 
AGTGCAG ^GTTTTTCACACTT CTTCATTT CGGTAGAT CTTAGATGAGGAA 
CGTTGA^GTTGTGCTTCTGCGTGTGCTTCTTCCTCCTCAAATACTCCTGC 

113 51 CTGATACCTCACCCCACCTGCCACTGAATGGCTCCATGGCCCCCTGCAGC 
CAGGGCCCTGATGAACCCGGCACTGCTTCAGATGCTGTTTAATAGCACAG 
TATGACCAAGTTGCACCTATGAATACACAAACAATGTGTTGCATCCTTCA 

11501 GCACTTGAGAAGAAGAGCCAAATTTGCATTGTCAGGAAATGGTTTAGTAA 
TTCTGCCAATTAAAACTTGTTTATCTACCATGGCTGTTTTTATGGCTGTT 
AGTAGXGGTACACTGATGATGAACAATGGCTATGCAGTAAAATCAAGACT 
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13 651 GT^AT^TTGCAACAGACTATAAAATTCCTCTGTGGCTTAGCCAATCTGG 
" TAC^CCCACATTGTATAAGAAATTTGGCAAGTTTAGAGCAATGTTTGAA 
GTG^GGGAAAXrTCTGTATACTCAAGAGGGCGTTTTTGAC^ACTGTAGA 
1] 801 ACA^^GGAATQ^AAAGGGGGTGGGAGGAAGTTAAAAGAAGAGGCAGGTGG 

" aagXsagcttgcagtcccgctgtgtgtacgacactggcaacatgaggtct 

TTG^A^TCTTGGTGCTTTGCTTCCTGCCCGTGGCTGCCTTAGGGTGCGA 
H 931 TCTGCCTCAGACCCACAGCCTGGGCAGCAGGAGGACCCTGATGCTGCTGG 
"• CTCnGATGAGGAGAATCAGCCTGTTTAGCTGCCTGAAGGATAGGCACGAT 

TTTGGCTTTCCTCAAGAGGAGTTTGGCAACCAGTTTCAGAAGGCTGAGAC 
12101 CATCCCTGTGCTGCACGAGATGATCCAGCAGATCTTTAACCTGTTTAGGA 

CCA^GGATAGCAGCGCTGCTTGGGATGAGACCCTGCTGGATAAGTTTTAC 

ACCG^SCTGTACCAGCAGCTGAACGATCTGGAGGCTTGCGTGATCCAGGG 
12251 CGTG^GCGTGACCGAGACCCCTCTGATGAAGGAGGATAGCATCCTGGCTG 

TC-a^a^GTACrrTCAGAGGATCACCCTGTACCTGAAGGAGAAGAAGTAC 

AGCCCCTGCGCTTGGGAAGTCGTGAGGGCTGAGATCATGAGGAGCTTTAG 
12<*01 CCXGA.GCACCAACCTGCAAGAGAGCTTGAGGTCTAAGGAGTAAAAAGTCT 
12 "° AGaoTCGGGGCGGCGCGTGGTAGGTGGCGGGGGGTTCCCAGGAGAGCCCC 

CAG^GCGGACGGCAGCGCCGTCACTCACCGCTCCGTCTCCCTCCGCCCAG 
12551 GGTCGCCTGGCGCAACCGCTGCAAGGGCACCGACGTCCAGGCGTGGATCA 

GAG^CTGCCGGCTGTGAGGAGCTGCCGCGCCCGGCCCGCCCGCTGCACAG 
- rcr.^cC^TTTGCGAGCGCGACGCTACCCGCTTGGCAGTTTTAAACGCAT 
12701 CCC^aTT^AAACGACTATACGCAAACGCCTfCCCGTCGGTCCGCGTCTC 
12701 ^x^c^CG^CAGGGCGACACTCGCGGGGAGGGCGGGAAGGGGGCCGGGC 

GGGiG-CCGCGGCCAACCGTCGCCCCGTGACGGCACCGCCCCGCCCCCGT 
12351 GACG-GGTGCGGGCGCGGGGGCCGTGGGGCTGAGCGCTGCGGCGGGGCCG 

SccgSSgggcgggagctgagcgcggggccsgctgc^ 

CTCrGGTGCAATATGTTCAAGAGAATGGCTGAGTTCGGGCCTGACTCCGG 
13001 GGG^GGGTGAAGGTGCGGCGCGGGCGGAGGGACGGGGCGGGCGCGGGGC 
ScCCG^G^TGCCGGGGCCTCTGCCGGCCCGCCCGGCTC^ 
^ ^..-.-TTACGGGCGCGCTTCTCGCCGCTGCCGCTTCTCTTCTCTCCCGC 

13151 gS^-ggcgtcaccatcgtgaagccggtagtgtacgggaacgtggcgcgg 

13131 ^^^tcgG^AAgVaGAGGXjAGGAGGACGGGCACACGCATCAGTGGACGGT 
TT^GTGAAGCCCTACAGGAACGAGGTAGGGCCCGAGCGCGTCGGCCGCC 
13 301 G^-"TCGGAGCGCCGGAGCCGTCAGCGCCGCGCCTGGGTGCGCTGTGGGA 

GTT'" i SGCTGCACGAGAGCTACGGGAATCCTCTCCGAGGTGGGTGTTGCG 

13*51 tcgg^Sgtttgctccgctcggtcccgctgaggctcgtcgccctcatctt 
13451 ^EctgccgSgtcgttaccaaaccgccgtacgagatcaccgaaacg 
13551 g^^ggggcgaa^tgaaa.tcatcatcaagatatttttcattgatccaaa 

S^CCGTAAGTACGCTCAGCTTCTCGTAGTGCTTCCCCCGTCCTG 

G^GGCCCGGGGCTGGGCTGCT 
13701gEgGAGCTG^GAGCTCCCTTTCCCGGGACGTGTGCTCTGTGTTCGGTC 

13701 ^rr^GGcSxCGGG^GGGCTTTGGCTCCATTTGGCTTCTCrGGCGCTTA 

SgS'g^g^cg^g^tacgcctgaactacagctgtgagaaggccgt 
13851 £££SSSS^ 

CTC^CTGTCTTTGGAAATGCTTrAGAGAAGGTCTCTGTGGTAGTTCTTA 
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1*001 ATTTWAGCAGGCACGTTTAATAACGAATACTGAATTTAAGTAACTCTGC 

SSgtatgacgtttattttcgtattcctgaaagccattaaaatcct 

GTGCAGTTGTTTAGTAAGAACAGCTGCCACTGTTTTGTATCTAGGAGATA 
1 4151 ACTGGTGTTTCCCTACAGTTCTCAAGCTGATAAAACTCTGTCTTTGTATC 
;^ScCCTGTATCACTTGCTGAAGCTTTTTCAGTCTGACACCAATGC 
^TCCTGGGAAAGAAAACTGTAGTTTCTGAATTCTATGATGAAATGGTAT 
14301 GAAAATTTTAATGTCAACCGAGCCTGACTTTATTTAAAAAAAATTATTGA 

14301 ^Sgtgtattttggtccttccttagatatttcaagatcctactgcc 

ATGATGCAGCAACTGCTAACGACGTCCCGTCAGCTGACACTTGGTGCTTA 
CAAGCATGAAACAGAGTGTAAGTGCAAAATGAGGATACCTTCGCCGACCG 
ZZZ~^ ™* r-rn aT^TTTTrTGTGGGATGTGATCGTACAGTGAGTTTGG 



rAAGCATGAAAC^UA^l^ l^^^ivjv-rvfvru-vj. ^r™-w- — 

TCATTCACTACTAATGTTTTCTGTGGGATGTGATCGTACAGTGAGTTTGG 
CTGTGTGAAATTTGAATAGCTTGGTATTGGCAGTGATGACGTGATCGATG 
_1_ ^^wmiTCi aGTAGAATAAATGCAGCCTGCTTTAT 



144S1 




JStgaccgag^ 
14751 ^SScS^cacctagxctttaattcccagggttttgtttt 

ttc^ttggtcatagtttttgtttttcactctggcaaatgatgttgtgaat 

1 a 901 TACACTGCTTCAGCCACAAAACTGATGGACTGAATGAGGTCATCAAACAA 
14901 SSScSrrCCGTATTTCCTTTTTTTTCCCGCACTTATCATTTTTAC 
TGCTGTrGTTGAGTCTGTAAGGCrAAAAGTAACTGTTTTGTGCTTTTTCA 
15051 GGACGTGTGCTTTCCAAATTACTGCCACATATATAAAGAAAGGTTGGAAT 
TTTAA^G ATAATT CATGTTTCTTCTT CTTTTTTGC CAC CAC AGTTGCAGA 
^S^GTAAAAACCAGGGA^AAGCTGGAAGCTGCCAAAAAGA 
^TGAAATTGCTGAGCTTAAAGAAAGGTTAAAAGCAAGTCGTGAAACC 
, , „ ~»^a a arr ar,a AAACTCGAAGAGGATGATCAGTC 



15201 GrrTTGAAATTGCTGAGCTTAAAfcrtww* j..^™— 

ATCAACTGCTTAAAGAGTGAAATCAGAAAACTCGAAGAGGATGATCAGTC 

JaaagSSS^ 

i 5351 AAAGGACTTCAGTCGTGGAGTTGTATGCGTTCT 

agISgtatgaatttcatttgcaaatcactg^gtgtgtgaca^^ 

. t Z.™-^ annaTGaATGTATCCTCATTTTATAGTTA 



15501 



aa ATrTATGGGTATGTACTGGTTTATTTCAA^A^fvi ^ 
CTTCG^OTCGMATOGCTG^TTGTATTGACMCATTTGAGTGGTGTAGGA 
ACATTTTCTCTATGGTCCCGTGTTAGTTTACAGAATGCCACTGTTCACTG 
1 5651 TrtTGTTCTCTATTTOACTTTTTCTACTGCAACGTCAAGGTTTTAAAACT 
JcIIaSAAAACATGCAGGTrTTT 

i 5a oi SSSSSSw^ 

AAAAGAGAAAACTTCATAAAATCA 

TTA^CA^CAGATAAGAGTTTCTAGCCCTGCATCTACTTTCACCTATG^G 
15951 TOGATGCC^^ATATTTTGTGTGTITGGATGCAGGAAGTGATTCC^CTC 
1S9S1 to^ATGTAGATATTCTATTTAACACTTGTACTCTGCTGTGCTTAGCCTT 

16051 ^C^^^^^CC^^^^ 
ATACAGATGGCAGACCCTCAGGCTTATAAAGGCTTGGGCATCTTCTTTAC 

Tr r TTTG AGATT CTGTGT TG CAGTAACCT CTGCCAGAGAGGAGAAAA.GCC 

1S201 cScSacctcIScccttcttctatagcaat^ 

G^GAACAGAGCACTGGTTTGA^CGTTTGATAATTAGCATTTAACATGGC 
T^^TA^G^MG^AGAACTGAAA.CAGCTGTGACAGTATGAACTCAGTATG 
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16351 GAGACTT C ATT AAGAC AAACAGCTGTT AAAAT CAGGCATGTTT C ATTGAG 
GAGGACGGGGCAACTTGCACCAGTGGTGCCCACACAAATCCTTCCTGGCG 
CTGCAGACCAATTTTTCTGGCATTCTGACTGCCGTTGCTGCTGGTCAaiG 

16501 AGAGCAACTATTTTTATCAGCCACAGGCAATTTGCTTGTAGTATTTTCCA 
AGTGTTGTAGGTAAGTATAAATGCATCGGCTCCAGAGCACTTTGAGTATA 
CTTATTAA AAACATAAATGAAAGACAAATTAGCTTTGCTTGGGTGCACAG 

16651 AACATTTTTAGTTCCAGCCTGCTTTTTGGTAGAAGCCCTCTTCTGAGGCT 
AGAACTGACTTTGACAAGTAGAGAAACTGGCAACGGAGCTATTGCTATCG 

16751 AAGGATC CTTGTT AACAAAGTTAATCGTCTTTTAAGGTTTGGTTTATTCA 
TTAAATTTGCTTTTAAGCTGT AG CTGAAAAAGAACGTGCTGT CTT CCAIG 

16851 CACCAGGTGGCAGCTCTGTGCAAAGTGCTCTCTGGTCTCACCAGCCTTTT 
AATTGCCGGGATTCTGGCACGTCTGAGAGGGCTCAGAGTGGCTTCGTTTG 
TTTGAACAGCGTGTACTGCTTTCTGTAGACATGGCCGGTTTCTCTCCTGC 

17001 AGCTTATGAAACTGTTCACACTGAACACACTGGAACAGGTTGCCCAAGGA 
GGCCGTGGATGCCCCATCCCTGGAGGCATTCAAGGCCAGGCTGGATGTGG 
CTCTGGGCAGCCTGGTCTGGTGGTTGGCGATCCTGCACATAGCAGCGGGG 

17151 TTGAAACTCGATGATCACTGTGGTCCTTTTCAACCCAGGCTATTCTATGA 
TTCTATGATTCAACAGCAAATCATATGTACTGAGAGAGGAAACAAACaC2 
AGTGCTACTGTTTGCAAGTTTrGTTCATTTGGTAAAAGAGTCAGGTTTTA 

17301 AAATTCAAAATCTGTCTGGTTTTGGTGTTTTTTTTTTTTTATTTATTATT 
TCTTTGGGGTTCTTTTTGATGCTTTATCTTTCTCTGCCAGGACTGTGTGA 
CAATGGGAACGAAAAAGAACATGCCAGGCACTGTCCTGGATTGCACACGC 

17451 TGGTTGCACTCAGTAGCAGGCTCAGAACTGCCAGTCTTTCCACAGTAi iA 
CTTTCTAAACCTAATTTTAATAGCGTTAGTAGACTTCCATCACTGGGCAG 
' TGCTTAGTGAATGCTCTGTGTGAACGTTTTACTTATAAGCATGTTGGAA3 

17601 TTTTGATGTTCCTGGATGCAGTAGGGAAGGACAGATTAGCTATGTGAA-A 
GTAGATTCTGAGTATCGGGGTTACAAAAAGTATAGAAACGATGAGA?ATT 
CTTGTTGTAACTAATTGGAATTTCTTTAAGCGTTCACTTATGCTAC-.TTC 

17751 ATAGTATTTCCATTTAAAAGTAGGAAAAGGTAAAACGTGAAATCGTGTGA 
TTTTCGG^TGGAACACCGCCTTCCTATGCACCTGACCAACTTCCAGAGGA 
AAAGCCTATTGAAAGCCGAGATTAAGCCACCAAAAGAACTCATTTGCATT 

17901 GGAATATGTAGTATTTGCCCTCTTCCTCCCGGGTAATTACTATACTTTAT 
AGGGTGCTTATATGTTAAATGAGTGGCTGGCACTTTTTATTCTCACAGCT 
GTGGGGAATTCTGTCCTCTAGGACAGAAACAATTTTAATCTGTTCCACTG 

' 18051 GTGACTGCTTTGTCAGCACTTGCACCTGAAGAGATCAATACACTCTTCAA 
TGTCTAGTTCTGCAACACTTGGCAAACCTCACATCTTATTTCATACTCTC 
TTCATGCCTATGCTTATTAAAGCAATAATCTGGGTAATTTTTGTTTTAA.T 

18201 CACTGTCCTGACCCCAGTGATGACCGTGTCCCACCTAAAGCTCAATTCAG 
GTCCTGAATCTCTTGAACTCTCTATAGCTAACATGAAGAATCTTCAPAAG 
TTAGGTCTGAGGGACTTAAGGCTAACTGTAGATGTTGTTGCCTGGTTTCT 
18351 GTGCTGAAGGCCGTGTAGTAGTTAGAGCATTCAACCTCTAGAAGAAGCTT 
GGCCAGCTGGTCGACCTGCAGATCCGGCCCTCGAGGGGGGGCCCGGTACC 
CAGCTTTTGTTCCCTTTAGTGAGGGTTAATTTCGAGCTTGGCGTAATCAT 
18501 GGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACAC 
AACATACGA.GCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGT 
GAGCrAACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGG 

18651 gaaacctgtcgtgccagctgcattaAtgaatcggccaacgcgcggggaga 
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GGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCC-CrC^TGACTCGCT 
GCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAG: . --.CTCAAAGGCGG 

18801 ^^^^^^^^^^^ 
GC aj^GGCCAGCAAAAGGCCAGGAACCGTAAAAAC--- - -GCGTTGCTGGC 

g5otccataggctccgcccccctgacc^gcatc?.^aaatcgacgct 

18951 C^GTCAGAGGTGGCGAAACCCGACAGGACTATAAASATACCAGGCGTTT 
rcCCCTGGAAGCTCCCTCGTGCGCTCTCCTGCT 



19101 




GCTCACGCTGTACitiiAX <~ 1 Utui J. w w , , 

GGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCT3CGCCTTATCCGG 

ta^ctatStcttg^gtccaacccggtaagagacg^cttatcgccactgg 

19251 CAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGG'TATGTAGGCGGTGCT 
A^S^GTTCCTGAAGTGGTGGCCTAACTACGGCTACACtAGAAGGACAGT 

A^TATCTGCGCTC^^ 
n Qini rviGCTCTTGATCCGGCAAACAAACCACCGCTGGT.-.---C^-TGGTTTTTTT 

1940 g^tccaa^ 

wtg^TCTTTTCTACGGGGTCTGACGCTCAGTGGAACSAAAACTCACGTT 

is55i SSSSSk.^^ 

> TT aaaTTAAAAATG AAGTTTT AAATCAAT CT AAAG . .-. - « A ATGAGTAAAC 
TOGOTCTG^CAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGA 

19701 S£S5E^«^^ 

ACTACGATACGGGAGGGCTTACCATCTGGCCCCAG - 

19851 CCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACi ~ ^lES^S^^jj^ 
CAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAC- 

TAGTTTGCGCAACGTTGTTGCC^^ 




GT?ACATGATCCCU^i^ii.^^^--~^~-"_;_„ 
TCCGATCGTTGTCAGAAGTAAGTTGGCCGCMTGi irt-^CTCATGGTTA 

20151 TGGCAGCACTGCATAATTCTCTTACTGTCATGGO-.i ^^t^^j^-jQCQ 
. TCTGTGACTGGTGAGTACTCAACC^GTCA^ 
GCGACCGAGTTGCTCTTGCCCGGCGTC^ 

2 045iSxgaScaaa^caggaaggcaaa^ 

ACACGGAAATGTTGAATACTCATACTGTTCCTTT^-"^^^^^^^^ 
C&TTTATCAGGGTTATTGTCTCATGAGCGGATACAiA. iTGAATGTATTT 

CCTAAATTGTAAGCGTTAATArTTTGCTAAAA^C^ : -^TTT^T 



2075 




ACCGTCTATCfiGGGCGRTGGCCCACTACGTGMCCATCACCCTAATCTM 
- nqn1 • TT TTTTGGGGTCGAGGTGCCGTAAAGCACTAAATCu-- W CCCTAAAGeeA 

20901 ^^^^^^^^^^^^ 
GAAGGGAAGAAAGCGAAAGGAGCGGGCGCTAGGGCG-w-GCAAGTGTAGC 
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21051 GGT CACGCTGCGCGTAACCACCACACCCGCCGCGCTTAATGCGC CGCTAC 
AGGGCGCGTCCCATTCGCCATTCAC3GCTGCGCAACTGTTGGGAAGGGCGA 
TCGGTGCGGGCCTCTTCGCTATTACGCC^GCTGGCGAAAGGGGGATGTGC 
- 21201 TGCAAGGCGATTAAGTTGGGTAACGCC^GGGTTTTCCCAGTCACGACGTT 
GTAAAACGACGGCCAGTGAATTGTAATACGACTCACTATAGGGCGAATTG 

21301 GAGCTCCACCGCGGTGGCGGCCGCTCTAG 
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SEQ ID NO. 14 I 

ATGGCTTTGA CCTTTGCCTT ACTGGTGGCT CTCCTGGTGC TGAGCTGCAA GAGCAGCTGC 
TCTGTGGGCT GCGATCTGCC TCA 

SEQ ID NO. 15 

gSacagc CTGGGCAGCA GGAGGACCCT GATGCTGCTG GCTCAGATGA GGAGAATCAG 
CCTGTTTAGC TGCCTGAAGG ATAGGCACGA TTTTGGCTTT 

SEQ ID NO. 16 

CTCAAGAGGA GTTTGGCAAC CAGTTTCAGA AGGCTGAGAC CATCCCTGTG GTGCACGAGA 
TG 

SEQ ID NO. 17 

TCCAGCAGAT CTTTAACCTG TTTAGCACCA AGGATAGCAG CGCTGCTTGG GATGAGACCC 
TGCTGGATAA GTTTTACACC GAGCTGTACC AGCA 

SEQ ID NO. 18 

' SSScGATC TGGAGGCTTG CGTGATCCAG GGCGTGGGCG TGACCGAGAC CCGTCTGATG 
AAGGAGGATA GCATCCT 

SEQ ID NO. 19 

gSgagga agtactttca gaggatcacc ctgtacctga aggagaagaa gtacagccct 

TGCGCTTGGG AAGTCGTGAG GG 
SEQ ID NO. 20 

c^agatcat gaggagcttt agcctgagca ccaacctgca agagagcttg aggtctaagg 

AGTAA 

SEQ ID NO. 21 

IFN-1 _ mitl 

CCCAAGCTTT CACCATGGCT TTGACCTTTG CCTT 

SEQ ID NO. 22 
" IFN-2b • 
ATCTGCCTCA GACCCACAG 
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SEQ ID NO. 23 
IFN-3C 

GATTTTGGCT TTCCTCAAGA GGAGTT 

SEQ ID NO. 24 
I FN- 4b 

GCACGAGATG ATCCAGCAGA T 

SEQ ID NO. 25 
I FN- 5 

ATCGTTCAGC TGCTGGTACA 

SEQ ID NO. 26 
I FN- 6 

CCTCACAGCC AGGATGCTAT 

SEQ ID NO. 27 
IFN-7 

ATGATCTCAG CCCTCACGAC 

SEQ ID- NO. 28 
IFN-2 

CTGTGGGTCT GAGGCAGAT 

SEQ ID NO. 29 
IFN-3b 

AACTCCTCTT GAGGAAAGCC AAAATC 

SEQ ID NO. 30 
IFN-4 

ATCTGCTGGA TCATCTCGTG^ C 

SEQ ID NO. 31 
IFN-8 

TGCTCTAGAC TTTTTACTCC TTAGACCTCA AGCTCT 
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SEQ ID NO. 32 

Oligo 1. TCACTCGAGG TGAATATCCA AGAAT 
SEQ ID NO. 33 

Oligo 2. GAGATCGATT TTGGCTGGAC ACTTG 
SEQ ID NO. 34 

Oligo 3. CACATCGATG TCACAACTTG GGAAT 
SEQ ID NO. 35 

Oligo 4. TCTAAGCTTC GTCACAGACC GTCCC 
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SEQUENCE LISTING 



<110> 


AviGenics, Inc. 


<120> 


Production of Transgenic Avians 


Using 


Sperm-mediated Trans fection 


<130> 


11106-021-228 


<140> 


To be assigned 


<141> 


2002-09-18 


<150> 


60/324,001 


<151> 


2001-09-21 


<150> 


60/323,961 


<151> 


2001-09-21 


<160> 


35 


<170> 


Patentln version 3.1 


<210> 


1 


<211> 


20 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Primer 5pLMAR2 


<400> 


1 


tgccgccttc tttgatattc - 


<210> 


2 


<211> 


20 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Primer LE-6.1kbrevl 


<400> 


2 


ttggtggtaa ggcctttttg 


<210> 


3 


<211> 


20 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Primer lys-6.1 



20 



20 



<400> 3 

ctggcaagct gtcaaaaaca 
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20 



<210> 4 
<211> . 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer LyaElrev 
<400> 4 

cagctcacat cgtccaaaga 

<210> 5 

<211> 498 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> IFNMAGMAX 
<220> 

<221> misc_feature 

<222> (1) .7(498) 
<223> 

<400> 5 . cn 

tgcgatctgc ctcagaccca cagcctgggc agcaggagga ccctgatgct gctggctcag 60 

atgaggagaa tcagcctgtt tagctgcctg aaggataggc acgattttgg ctttcctcaa 120 

gaggagtttg gcaaccagtt tcagaaggct gagaccatcc ctgtgctgca cgagatgatc 180 

cagcagatct ttaacctgtt tagcaccaag gatagcagcg ctgcttggga tgagaccctg 240 

ctggataagt tttacaccga gctgtaccag cagctgaacg atctggaggc ttgcgtgatc 300 

cagggcgtgg gcgtgaccga gacccctctg atgaaggagg atagcatcct ggctgtgagg 360 

aagtactttc agaggatcac cctgtacctg aaggagaaga agtacagccc ctgcgcttgg 420 

gaagtcgtga gggctgagat catgaggagc tttagcctga gcaccaacct gcaagagagc 480 

ttgaggtcta aggagtaa 

<210> 6. 

<211> 12728 

<212> DNA 

<213> Gallus gallus 

<220> 

<221> mi s cofeature 
<222> . (1) (237) 

<223> Sprime matrix (scaffold) attachment region (MAR) 
<220> 

<221> misc* feature 



498 
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<222> (261) (1564) 

<223> Sprime matrix (scaffold) attachment region (MAR) 



<220> 

<221> misc_feature 

<222> (1565) (1912) 

<223> 5prime matrix (scaffold) attachment region (MAR) 



<220> 

<221> misc_feature 

<222> (1930) (2012) 

<223> Sprime matrix (scaffold) attachment region (MAR) 



<220> 

<221> misc_feature 

<222> (2013) (2671) 

<223> Intrinsically curved DNA 



<220> 

<221> misc_feature 

<222> (5848) (5934) 

<223> Transcription Enhancer 



<220> 

<221> mis cofeature 

<222> . (9160) (9325) 

<223> Transcription Enhancer 



<220> 

<221> misc_feature 

<222> (9326) (9626) 

<223> Negative Regulatory Element 



<220> 

<221> mis cofeature 

<222> (9621) . . (9660) 

<223> Hormone Response Element 



<220> 

<221> misc feature 

<222> (9680) (10060) 

<223> Hormone Response Element 



<220> 

<221> mis cofeature 

<222> (10576) . . (10821) 

<223> Chicken CR1 Repeat Sequence 
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<220> 

<221> misc feature 

<222> (10926) (11193) 

<223> Chicken CR1 Repeat Sequence 

<220> 

<221> misc_feature 

<222> (11424) (11938) 

<223> Lysozyme Proximal Promoter and Lysozyme Signal Peptide 
<220> 

<221> misc feature 

<222> (11946) (12443) 

<223> Human Interferon alpha 2d encoding region codon optimized for exp 
ression in chicken cells (IFNMAGMaX) 

<220> 

<221> polyA_signal 

<222> (12444) (12728) 
<223> 



<400> 6 
tgccgccttc 


tirxgatatxc acrctgui-gu 




ttcttcrccaa 


taaaacroata 

r 


60 


taacagtctg 


tataacagtc tgtgaggaaa 


tacttggtat 


ttcttctgat 


cagtgttttt 


120 


ataagtaatg 


ttgaatattg gataaggctg 


tgtgtccttt 


gtcttgggag 


acaaagccca 


180 


cagcaggtgg 


tggttggggt ggtggcagct 


cagtgacagg 


agaggttttt 


ttgcctgttt 


240 


tttttttttt 


tttttttttt aagtaaggtg 


ttcttttttc 


ttagtaaatt 


ttctactgga 


300 


ctgtatgttt 


tgacaggtca gaaacatttc 


ttcaaaagaa 


gaaccttttg 


gaaactgtac 


360 


agcccttttc 


tttcattccc tttttgcttt. 


ctgtgccaat 


gcctttggtt 


ctgattgcat 


420 


tatggaaaac 


gttgatcgga acttgaggtt 


tttatttata 


gtgtggcttg 


aaagcttgga 


480 


tagctgttgt 


tacacgagat accttattaa 


gtttaggcca 


gcttgatgct 


ttattttttc 


540 


cctttgaagt 


agtgagcgtt ctctggtttt 


tttcctttga 


aactggtgag 


gcttagattt 


600 


ttctaatggg 


attttttacc tgatgatcta 


gttgcatacc 


caaatgcttg 


taaatgtttt 


660 


cctagttaac 


atgttgataa cttcggattt 


acatgttgta 


tatacttgtc 


atctgtgttt 


720 


ctagtaaaaa 


tatatggcat ttatagaaat 


acgtaattcc 


tgatttcctt 


tttttttatc 


780- 


tctatgctct 


gtgtgtacag gtcaaacaga 


cttcactcct 


atttttattt 


atagaatttt 


840 


atatgcagtc 


tgtcgttggt tcttgtgttg 


taaggataca 


gccttaaatt 


tcctagagcg 


900 


atgctcagta 


aggcgggttg tcacatgggt 


tcaaatgtaa 


aacgggcacg 


tttggctgct 


960 


gccttcccga 


gatccaggac actaaactgc 


ttctgcactg 


aggtataaat 


cgcttcagat 


1020 
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cccagggaag 


tgcagatcca 


cgtgcatatt 


cttaaagaag 


aatgaatact 


ttctaaaata 


1080 


ttttggcata 


ggaagcaagc 


tgcatggatt 


tgtttgggac 


ttaaattatt 


ttggtaacgg 


1140 


agtgcatagg 


ttttaaacac 


agttgcagca 


tgctaacgag 


tcacagcgtt 


tatgcagaag 


1200 


tgatgcctgg 


atgcctgttg 


cagctgttta 


cggcactgcc 


ttgcagtgag 


cattgcagat 


1260 


aggggtgggg 


tgctttgtgt 


cgtgttccca 


cacgctgcca 


cacagccacc 


tcccggaaca 


1320 


catctcacct 


gctgggtact 


tttcaaacca 


tcttagcagt 


agtagatgag 


ttactatgaa 


. 1380 


acagagaagt 


tcctcagttg 


gatattctca 


tgggatgtct 


tttttcccat 


gttgggcaaa 


1440 


gtatgataaa 


gcatctctat 


ttgtaaatta 


tgcacttgtt 


agttcctgaa- 


tcctttctat 


1500 


agcaccactt 


attgcagcag 


gtgtaggctc 


tggtgtggcc 


tgtgtctgtg 


cttcaatctt 


1560 


ttaaagcttc 


tttggaaata 


cactgacttg 


attgaagtct 


cttgaagata 


gtaaacagta 


1620 


cttacctttg 


atcccaatga 


aatcgagcat 


ttcagttgta 


aaagaattcc 


gcctattcat 


1680 


accatgtaat 


gtaattttac 


acccccagtg 


ctgacacttt 


ggaatatatt 


caagtaatag 


1740 


actttggcct 


caccctcttg 


tgtactgtat 


tttgtaatag 


aaaatatttt 


aaactgtgca 


1800 


tatgattatt 


acattatgaa 


agagacattc 


tgctgatctt 


caaatgtaag 


aaaatgagga 


1860 


gtgcgtgtgc 


ttttataaat 


acaagtgatt 


gcaaattagt 


gcaggtgtcc 


ttaaaaaaaa 


1920 


aaaaaaaaag 


taatataaaa 


aggaccaggt 


gttttacaag 


tgaaatacat 


tcctatttgg 


1980 


taaacagtta 


catttttatg 


aagattacca 


gcgctgctga 


ctttctaaac 


ataaggctgt 


2040 


attgtcttcc 


tgtaccattg 


catttcct ca 


ttcccaattt 


gcacaaggat 


gtctgggtaa 


2100 


actattcaag 


aaatggcttt 


gaaatacagc 


atgggagctt 


gtctgagttg 


gaatgcagag 


2160 


ttgcactgca 


aaatgtcagg 


aaatggatgt 


ctctcagaat 


gcccaactcc 


aaaggatttt 


2220 


atatgtgtat 


atagtaagca 


gtttcctgat 


tccagcaggc 


caaagagtct 


gctgaatgtt 


2280 


gtgttgccgg 


agacctgtat 


ttctcaacaa 


ggtaagatgg 


tatcctagca 


actgcggatt 


2340 


ttaatacatt 


ttcagcagaa 


gtacttagtt 


aatctctacc 


tttagggatc 


gtttcatcat 


2400 


ttttagatgt 


tatacttgaa 


atactgcata 


acttttagct 


^ttcatgggtt 


cctttttttc 


2460 


agcctttagg 


agactgttaa 


gcaatttgct 


gtccaacttt 


tgtgttggtc 


ttaaactgca 


2520 


atagtagttt 


accttgtatt 


gaagaaataa 


agaccatttt 


tatattaaaa 


aatacttttg 


2580 


tctgtcttca 


ttttgacttg 


tctgatatcc 


ttgcagtgcc 


cattatgtca 


gttctgtcag 


2640 


atattcagac 


atcaaaactt 


aacgtgagct 


cagtggagtt 


acagctgcgg 


ttttgatgct 


2700 


gttattattt 


ctgaaactag 


aaatgatgtt 


gtcttcatct 


gctcatcaaa 


cacttcatgc 


2760 
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agagtgtaag 
tatcagattt 
tgaaatgcag 
tttggaatga 
gagaaagtga 
cccgattcct 
ggccactggt 
ttgagcatgg 
gtgctgggct 
cagggaaaag 
acgtagggtg 
acagtggaag 
aagcggtatc 
gcagtctggg 
taggagaact 
ttctgcagca 
gtccaagctt 
aactgatgtc 
agagagctaa 
aggcaaaacc 
aagcccccag 
gcctagggct 
ttgagattta 
agcctgtact 
aagttgcaag 
gaaactactg 
gttacatgtc 
cgcatttgtc 
tcagaagaaa 



gctagtgaga 
ttttttcatt 
tctgattggc 
aggaagttaa 
acctggattt 
tgaaagggct 
tatttactgc 
actatagcct 
gtggctgggg 
tgtgggtaac 
tgtactctcg 
cattcaaggg 
agaagagcga 
aaagtagcac 
ttcttgctga 
cctgcaaggc 
cagcaggtca 
gaagcctcct 
ctctatgcca 
ggctgcccca 
gcagtgtgac 
ctgcccgcga 
gacacaaggg 
tcaaatatat 
agattgaagg 
cttctaaaca 
tgatgcactt 
acttatccca 
cagatgtgat 



aatgcataca 
tggaaatata 
atgaagaagc 
gcaagggcac 
ctttggctag 
ccagctttaa 
attatgtctc 
ggcttcagag 
ggactgtggg 
tatttttaag 
aagattaaca 
tagatcatct 
ggaaggtaag 
cccttgagca 
attctacttg 
ccagagcctg 
ttgtctttgc 
gtccactacc 
tagtctgaag 
tgagaagaaa 
aggcccctcc 
agtgcgtgtt 
aagcctgaaa 
attttgtgag 
ctgagtagtt 
cttgtttgag 
gcttgtcctt 
tatctgtcat 
aatccccagc 



tttattgata 
ttgttttcta 
acagcactct 
aggtccatga 
tgttctaaat 
tgcttccaaa 
agtttcgcag 
gccaggtgaa 
gactccaagc 
tactgtgttg 
gtgtgggttc 
aacgacacca 
cagtcttcat 
gagacaagga 
caagagcttt 
tggtgagctg 
ttcttccccc 
tgttgctgca 
gtaaaatggg 
gcagtggtaa 
tgccacctag 
tctttggtgg 
ggaggtgttg 
ggagtgtagc 
gagagggtaa 
tggtgagacc 
ttccatccac 
atctgacata 
cgccccaagt 



cttttttaaa 
gactgcatag 
tcatcttact 
aatagagaca 
ctgtagtgag 
ttgaaggtgg 
ctaacctggc 
ggttgggatg 
tgagcttggg 
caaacgtctc 
agtaatatat 
gatcatcaag 
atgttttccc 
aataattcag 
gatgcctggc 
gagggaaaga 
agcactgtgc 
ggcagactgc 
ttttaaaaaa 
acatggtaga 
aggcgggaac 
gttttgtttg 
ggcactattt 
gaattggcca 
cacgtttaat 
ttggataggt 
atccatgcat 
cctgtctctt 
ttgagaagat 



gtcaactttt 
cttctgaatc 
taaacttcat 
gtgcgctcag 
gaaagtaaca 
caggcaactt 
ttctccacta 
ggtggaagga 
gtgggcagca 
atctgcaaat 
ggatgaattc 
ctatgattgg 
tccacgtaaa 
gagcatgtgc 
ttctggtgcc 
ttctgctcaa 
agcagagtgg 
tctcagaaaa 
gaaaacacaa 
aaaggtgcag 
aagcttccct 
gcgtttggtt 
tggtttgtaa 
atttaaaata 
gagatcttct 
gagtgctctt 
tccacatcca 
cgtcacttgg 
ggcagttgct 



2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
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tctttccctt • 


tttcctgcta 


agtaaggatt ttctcctggc 


tttgacacct 


cacgaaatag 


4560 


tcttcctgcc 


ttacattctg 


ggcattattt caaatatctt 


tggagtgcgc 


tgctctcaag 


4620 


tttgtgtctt 


cctactctta 


gagtgaatgc tcttagagtg 


aaagagaagg 


aagagaagat 


4680 


gttggccgca 


gttctctgat 


gaacacacct ctgaataatg 


gccaaaggtg 


ggtgggtttc 


4740 


tctgaggaac 


gggcagcgtt 


tgcctctgaa agcaaggagc 


tctgcggagt 


tgcagttatt 


4800 


ttgcaactga 


tggtggaact 


ggtgcttaaa gcagattccc 


taggttccct 


gctacttctt 


4860 


ttccttcttg 


gcagtcagtt 


tatttctgac agacaaacag 


ccacccccac 


tgcaggctta 


4920 


gaaagtatgt 


ggctctgcct 


gggtgtgtta cagctctgcc 


ctggtgaaag 


gggattaaaa 


4980 


cgggcaccat 


tcatcccaaa 


caggatcctc attcatggat 


caagctgtaa 


ggaacttggg 


"5040 


ctccaacctc 


aaaacattaa 


ttggagtacg aatgtaatta 


aaactgcatt 


ctcgcattcc * 


5100 


taagtcattt 


agtctggact 


ctgcagcatg 


taggtcggca 


gctcccactt 


tctcaaagac 


5160 


cactgatgga 


ggagtagtaa 


aaatggagac 


cgattcagaa 


caaccaacgg 


agtgttgccg 


5220 


aagaaactga 


tggaaataat 


gcatgaattg 


tgtggtggac 


atttttttta 


aatacataaa 


5280 


ctacttcaaa 


tgaggtcgga gaaggtcagt 


gttttattag 


cagccataaa 


accaggtgag 


5340 


cgagtaccat 


ttttctctac 


aagaaaaacg 


attctgagct 


ctgcgtaagt 


ataagttctc 


5400 


catagcggct 


gaagctcccc 


cctggctgcc 


tgccatctca 


gctggagtgc 


agtgccattt 


5460 


ccttggggtt 


tctctcacag 


cagtaatggg 


acaatacttc 


acaaaaattc 


tttcttttcc 


5520 


tgtcatgtgg 


gatccctact 


gtgccctcct 


ggttttacgt 


taccccctga 


ctgttccatt 


5580 


cagcggtttg 


gaaagagaaa aagaatttgg 


aaataaaaca 


tgtctacgtt 


atcacctcct 


5640 


ccagcatttt 


ggtttttaat 


tatgtcaata 


actggcttag 


atttggaaat 


gagagggggt 


5700 


tgggtgtatt 


accgaggaac 


aaaggaaggc 


ttatataaac 


tcaagtcttt 


tatttagaga 


5760 


actggcaagc 


tgtcaaaaac aaaaaggcct 


taccaccaaa 


ttaagtgaat 


agccgctata 


5820 


gccagcaggg 


ccagcacgag 


ggatggtgca 


ctgctggcac 


tatgccacgg 


cctgcttgtg 


5880 


actctgagag 


caactgcttt ggaaatgaca 


gcacttggtg 


caatttcctt 


tgtttcagaa 


5940 


tgcgtagagc 


gtgtgcttgg 


cgacagtttt 


tctagttagg 


ccacttcttt 


tttccttctc 


6000 


tcctcattct 


cctaagcatg 


tctccatgct 


ggtaatccca 


gtcaagtgaa 


cgttcaaaca 


6060 


atgaatccat 


cactgtagga 


ttctcgtggt 


gatcaaatct 


ttgtgtgagg 


tctataaaat 


6120 


atggaagctt 


atttattttt 


cgttcttcca 


tatcagtctt 


ctctatgaca 


attcacatcc 


6180 


accacagcaa 


attaaaggtg aaggaggctg 


gtgggatgaa 


gagggtcttc 


tagctttacg 


6240 
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ttcttccttg 
agttcagtct 
aggaccaaat 
cactatttca 
acattacata 
ctgtgtttaa 
aggggcctta 
ctcaggctgc 
acagcttctc 
ttaacatcta 
ctatctgtcc 
ggctgcagtg 
cagcctgtct 
cacggctttc 
gcagagcaga 
cccagggtac 
catccaccag 
tcaggagact 
atatacattt 
tacatgcaga 
caatttgctg 
ttaccttttg 
ctctgctctg 
. ttgtcctcca 
cttcagcagc 
ttttcagcag 
cagttcttct 
gcattttatt 
ctgggatttc 



caaggccaca 

cctgctggga 

agggtctatc 

ctgctcccac 

aatttgacct 

ccccttaagg 

aacatcatcc 

ccagggcccc 

tgggcagcct 

atctaaatct 

aagaaatgtg 

aggtctcccc 

tcgtaggaga 

ttgtggagcc 

tggggacaat 

tgttggcctt 

aacccacgct 

tccattcttt 

cagttcatgt 

attcctagtg 

caagtacctt 

gggtaagctt 

ttctgactgc 

tcctttccca 

catttaattc 

tcttgcaaag 

tgtttgaggt 

acttctatta 

cacagtgtct 



ggaaaatgct 

cagctaaccg 

tggggttttt 

ggttacaaac 

ggtaccaata 

cattcagaac 

atttccaacc 

atccagcctg 

gtgccaacac 

cttctctttt 

tattggtctc 

acagccttct 

tcatcttagt 

ccaggtctgg 

cgcttacccc 

tcaggctccc 

tcctggttaa 

aggacagact 

ttcctgtaac 

ccatctcagt 

ccaagctgcg 

ttgtatctgc 

accattttct 

gcttgtatct 

ttcagtgtca 

aacatctagc 

gagccataaa 

tgtacttact 

ctgtgtcctt 



gagagctgta 

catcttataa 

gttcctgctg 

caaagataca 

ttgttctcta 

aactagaatc 

ctctgccatg 

gccttgagca 

ctcaccactc 

agtttaaagc 

cctcctgctt 

cttctccagg 

ggccctcctc 

atgcagtact 

tccctgctgg 

agaccccttg 

tacttctgcc 

gtgttacacc 

aggacagaat 

agggttttca 

gcctcccata 

agagaccctg 

agatcaccca 

ttgacaaata 

tcttgttctg 

tgaaaacttt 

ttactagaac 

ttgacataac 

cacatggttt 



gaatacagcc 

ccccttctga 

ttcctcctgg 

gcctgaattt 

tatagttatt 

atagaatggt 

ggctgcttgc 

cctccaggga 

tctgggtaaa 

cattcctctt 

ataagcagga 

ctgaacaagc 

tggacccatt 

tcagatgggg 

ctgcccctgt 

ctgatttgtg 

ctcacttctg 

tacctgccct 

atgtattcct 

tggcagtatt 

aatcctgtat 

ggggttctga 

gttgttcctg 

caggcctatt 

ttgatgccac 

ctgccattca 

ttcgtcactg 

acagacacgc 

tactgtcata 



tggggtaaga 

gactcatctt 

aaggctatct 
i 

tttctaggcc 

tccttcccca 

ttggattgga . 

cacccactgg 

tggggcaccc 

gaattct'ctt 

tttcccgttg 

agtactggaa 

ccagctcctt 

ccaacagttc 

ccttacaaag 

tttgatgcag 

tcaagctttt 

taagcttgtt 

attcttgcat 

ctaacaaaaa 

agcacatagt 

ttgggatcag 

tgtgcttcag 

tacaacttcc 

tttgtgtttg 

tggaacagga 

atattcttac 

acaagtttat 

acatattttg 

cttccgttat 



6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
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aaccttggca atctgcccag, ctgcccatca caagaaaaga gattcctttt ttattacttc 8040 

tcttcagcca ataaacaaaa tgtgagaagc ccaaacaaga acttgtgggg caggctgcca 8100 

tcaagggaga gacagctgaa gggttgtgta gctcaataga attaagaaat aataaagctg 8160 

tgtcagacag ttttgcctga tttatacagg cacgccccaa gccagagagg ctgtctgcca 8220 

aggccacctt gcagtccttg gtttgtaaga taagtcatag gtaacttttc tggtgaattg 8280 

cgtggagaat catgatggca gttcttgctg tttactatgg taagatgcta aaataggaga 8340 

cagcaaagta acacttgctg ctgtaggtgc tctgctatcc agacagcgat ggcactcgca 8400 

caccaagatg agggatgctc ccagctgacg gatgctgggg cagtaacagt gggtcccatg 8460 

ctgcctgctc attagcatca cctcagccct caccagccca tcagaaggat catcccaagc 8520 

tgaggaaagt tgctcatctt cttcacatca tcaaaccttt ggcctgactg atgcctcccg 8580 

gatgcttaaa tgtggtcact gacatcttta tttttctatg atttcaagtc agaacctccg 8640 

gatcaggagg gaacacatag tgggaatgta ccctcagctc caaggccaga tcttccttca 8700 

atgatcatgc, atgctactta ggaaggtgtg tgtgtgtgaa tgtagaattg cctttgttat 8760 

tttttcttcc tgctgtcagg aacattttga ataccagaga aaaagaaaag tgctcttctt 8820 

ggcatgggag gagttgtcac acttgcaaaa taaaggatgc agtcccaaat gttcataatc 8880 

tcagggtctg aaggaggatc agaaactgtg tatacaattt caggcttctc tgaatgcagc 8940 

ttttgaaagc tgttcctggc cgaggcagta ctagtcagaa ccctcggaaa caggaacaaa 9000 

tgtcttcaag gtgcagcagg aggaaacacc ttgcccatca tgaaagtgaa taaccactgc 9060 

cgctgaagga atccagctcc tgtttgagca ggtgctgcac actcccacac tgaaacaaca 9120 

gttcattttt ataggacttc caggaaggat cttcttctta agcttcttaa ttatggtaca 9180 

tctccagttg gcagatgact atgactactg acaggagaat gaggaactag ctgggaatat 9240 

ttctgtttga ccaccatgga gtcacccatt tctttactgg tatttggaaa taataattct 9300 

gaattgcaaa gcaggagtta gcgaagatct tcatttcttc catgttggtg acagcacagt 9360 

tctggctatg aaagtctgct tacaaggaag aggataaaaa tcatagggat aataaatcta 9420 

agtttgaaga caatgaggtt ttagctgcat ttgacatgaa gaaattgaga cctctactgg 9480 

atagctatgg tatttacgtg tctttttgct tagttactta ttgaccccag ctgaggtcaa 9540 

gtatgaactc aggtctctcg ggctactggc atggattgat tacatacaac tgtaatttta 9600 

gcagtgattt agggtttatg agtacttttg cagtaaatca tagggttagt aatgttaatc 9660 

tcagggaaaa aaaaaaaaag ccaaccctga cagacatccc agctcaggtg gaaatcaagg 9720 
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atcacagctc agtgcggtcc cagagaacac agggactctt ctcttaggac ctttatgtac 9780 

agggcctcaa gataactgat gttagtcaga agactttcca ttctggccac agttcagctg 9840 

aggcaatcct ggaattttct ctccgctgca cagttccagt catcccagtt tgtacagttc 9900 

tggcactttt tgggtcaggc cgtgatccaa ggagcagaag ttccagctat ggtcagggag 9960 

tgcctgaccg tcccaactca ctgcactcaa acaaaggcga aaccacaaga gtggcttttg 10020 

ttgaaattgc agtgtggccc agaggggctg caccagtact ggattgacca cgaggcaaca 10080 

ttaatcctca gcaagtgcaa tttgcagcca ttaaattgaa ctaactgata ctacaatgca 10140 

atcagtatca acaagtggtt tggcttggaa gatggagtct aggggctcta caggagtagc 10200 

tactctctaa tggagttgca ttttgaagca ggacactgtg aaaagctggc ctcctaaaga 10260 

ggctgctaaa cattagggtc aattttccag tgcactttct gaagtgtctg cagttcccca 10320 

tgcaaagctg cccaaacata gcacttccaa ttgaatacaa ttatatgcag gcgtactgct 10380 

tcttgccagc actgtccttc tcaaatgaac tcaacaaaca atttcaaagt ctagtagaaa 10440 

gtaacaagct ttgaatgtca ttaaaaagta tatctgcttt cagtagttca gcttatttat 10500 

gcccactaga aacatcttgt acaagctgaa cactggggct ccagattagt ggtaaaacct 10560 

actttataca atcatagaat catagaatgg cctgggttgg aagggacccc aaggatcatg 10620 

aagatccaac acccccgcca caggcagggc caccaacctc cagatctggt actagaccag 10680 

gcagcccagg gctccatcca acctggccat gaacacctcc agggatggag catccacaac 10740 

ctctctgggc agcctgtgcc agcacctcac caccctctct gtgaagaact tttccctgac 10800 

atccaatcta agccttccct ccttgaggtt agatccactc ccccttgtgc tatcactgtc 10860 

tactcttgta aaaagttgat tctcctcctt tttggaaggt tgcaatgagg tctccttgca 10920 

gccttcttct cttctgcagg atgaacaagc ccagctccct cagcctgtct ttataggaga 10980 

ggtgctccag ccctctgatc atctttgtgg ccctcctctg gacccgctcc 1 aagagctcca 11040 

catctttcct gtactggggg ccccaggcct gaatgcagta ctccagatgg ggcctcaaaa 11100 

gagcagagta aagagggaca atcaccttcc tcaccctgct ggccagccct cttctgatgg 11160 

agccctggat acaactggct ttctgagctg caacttctcc ttatcagttc cactattaaa 11220 

acaggaacaa tacaacaggt gctgatggcc agtgcagagt ttttcacact tcttcatttc 11280 

ggtagatctt agatgaggaa cgttgaagtt gtgcttctgc gtgtgcttct tcctcctcaa 11340 

atactcctgc ctgatacctc accccacctg ccactgaatg gctccatggc cccctgcagc 11400 

cagggccctg atgaacccgg cactgcttca gatgctgttt aatagcacag tatgaccaag 11460 
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ttgcacctat 


gaatacacaa 


acaatgtgtt 


aatttgcatt 


gtcaggaaat 


ggtttagtaa 


tggctgtttt 


tatggctgtt 


agtagtggta 


aatcaagact 


gtagatattg 


caacagacta 


tacttcccac 


attgtataag 


aaatttggca 


atttctgtat 


actcaagagg 


gcgtttttga 


tgggaggaag 


ttaaaagaag 


aggcaggtgc 


acactggcaa 


catgaggtct 


ttgctaatct 


tagggtgcga 


tctgcctcag 


acccacagcc 


ctcagatgag 


gagaatcagc 


ctgtttagct 


ctcaagagga 


gtttggcaac 


cagtttcaga 


tgatccagca 


gatctttaac 


ctgtttagca 


ccctgctgga 


taagttttac 


accgagctgt 


tgatccaggg 


cgtgggcgtg 


accgagaccc 


tgaggaagta 


ctttcagagg 


atcaccctgt 


cttgggaagt 


cgtgagggct 


gagatcatga 


aaaacttaac 


qtctaaqqaa 


taaaaagtct 


acatgataag 


atacattgat 


gagtttggac 


gctttatttg 


tgaaatttgt 


gatgctattg 


aacaagttaa 


caacaacaat 


tgcattcatt 


aggtttttta 


aagcaagtaa 


aacctctaca 



gcggccgc 



<210> 7 

<211> 11945 

<212> DNA 

<213> Gallus gallus 

<220> 

<221> mis cofeature 

<222> (1) .7(237) 

<223> 5prime matrix attachment 



gcatccttca 


gcacttgaga 


agaagagcca 


11520 


ttctgccaat 


taaaacttgt 


ttatctacca 


11580 


cactgatgat 


gaacaatggc 


tatgcagtaa 


11640 


taaaattcct 


ctgtggctta 


gccaatgtgg 


11700 


agtttagagc 


aatgtttgaa 


gtgttgggaa 


11760 


caactgtaga 


acagaggaat 


caaaaggggg 


11820 


aagagagctt 


gcagtcccgc 


tgtgtgtacg 


11880 


tggtgctttg 


cttcctgccc 


ctggctgcct 


11940 


tgggcagcag 


gaggaccctg 


atgctgctgg 

-> 


12000 


gcctgaagga 


taggcacgat 


tttggctttc 


12060 


aggctgagac 


catccctgtg 


ctgcacgaga 


12120 


ccaaggatag 


cagcgctgct 


tgggatgaga 


12180 


accagcagct 


gaacgatctg 


gaggcttgcg 


12240 


ctctgatgaa 


ggaggatagc 


atcctggctg 


12300 


acctgaagga 


gaagaagtac 


agcccctgcg 


12360 


ggagctttag 


cctgagcacc 


aacctgcaag 


12420 


agagtcgggg 


cggccggccg 


cttcgagcag 


12480 




tagaatgcag 


tgaaaaaaat 


12540 


ctttatttgt 


aaccattata 


agctgcaata 


12600 


ttatgtttca 


ggttcagggg 


gaggtgtggg 


12660 


aatgtggtaa 


aatcgataag 


gatccgtcga 


12720 
12728 



region (MAR) 



<220> 
<221> 



misc feature 
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<222> (261) (1564) 

<223> Sprime matrix attachment region (MAR) 



<220> 

<221> misc_feature 

<222> (1565) (1912) 

<223> Sprime matrix attachment region (MAR) 



<220> 

<221> misc_feature 

<222> (1930) . . (2012) 

<223> Sprime matrix attachment region (MAR) 



<220> 

<221> misc_feature 

<222> (2013) . . (2671) 

<223> Intrinsically Curve* DNA. 



<220> 

<221> mis cofeature 

<222> (5848) • . (5934) 

<223> Transcription Enhancer 



<220> 

<221> mis cofeature 

<222> (9160) . . (9325) 

<223> Transcription Enhancer 



<220> 

<221> misc_feature 

<222> (9326) . . (9626) 

<223> Negative Regulatory Element 



<220> 

<221> misc_feature 

<222> (9621) . . (9660) - 

<223> Hormone Response Element 



<220> 

<221> misc_feature 

<222> (9680) ..(10060) 

<223> Hormone Response Element 



<220> 

<221> misc_feature 

<222> (10576) (10821) 

<223> Chicken CR1 Repeat 
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<220> 

<221> misc feature 

<222> (10926) . . (11193) 

<223> Chicken CR1 Repeat 

<220> 

<221> misc_feature 

<222> (11424) (11938) 

<223> Proximal promoter and lysozyme signal peptide 



<400> 7 
tgccgccttc 


tttgatattc 


actctgttgt 


atttcatctc ttcttgccga 


tgaaaggata 


60 


taacagtctg 


tataacagtc 


tgtgaggaaa 


tacttggtat ttcttctgat 


cagtgttttt 


120 


ataagtaatg 


ttgaatattg 


gataaggctg 


tgtgtccttt gtcttgggag 


acaaagccca 


180 


cagcaggtgg 


tggttggggt 


ggtggcagct 


cagtgacagg agaggttttt 


ttgcctgttt 


240 


tttttttttt 


tttttttttt 


aagtaaggtg 


ttcttttttc ttagtaaatt 


ttctactgga 


300 


ctgtatgttt 


tgacaggtca 


gaaacatttc 


ttcaaaagaa gaaccttttg 


gaaactgtac 


360 


agcccttttc 


tttcattccc 


tttttgcttt 


ctgtgccaat gcctttggtt 


ctgattgcat 


420 


tatggaaaac 


gttgatcgga 


acttgaggtt 


tttatttata gtgtggcttg 


aaagcttgga 


430 


tagctgttgt 


tacacgagat 


accttattaa 


gtttaggcca gcttgatgct 


ttattttttc 


540 


cctttgaagt 


agtgagcgtt 


ctctggtttt 


tttcctttga aactggtgag 


gcttagattt 


600 


ttctaatggg 


attttttacc 


tgatgatcta 


gttgcatacc caaatgcttg 


taaatgtttt 


660 


cctagttaac 


atgttgataa 


cttcggattt 


acatgttgta tatacttgtc 


atctgtgttt 


720 


CT.agraaaaa 


4* ^ 4" ■a 4* rr/T/« o 4* 

tataLygcaL 




af*rrt"a A-t'tTT' tcratttcctt 


tttttttatc 


780 


tctatgctct 


gtgtgtacag 


gtcaaacaga 


cttcactcct atttttattt 


atagaatttt 


840 


atatgcagtc 


tgtcgttggt 


tcttgtgttg 


taaggataca gccttaaatt 


tcctagagcg . 


900 


atgctcagta 


aggcgggttg 


tcacatgggt 


tcaaatgtaa aacgggcacg 


tttggctgct 


960 


gccttcccga 


gatccaggac 


actaaactgc 


ttctgcactg aggtataaat 


cgcttcagat 


1020 


cccagggaag 


tgcagatcca 


cgtgcatatt 


cttaaagaag aatgaatact 


ttctaaaata 


1080 


ttttggcata 


ggaagcaagc 


tgcatggatt 


tgtttgggac ttaaattatt 


ttggtaacgg 


1140 


agtgcatagg 


ttttaaacac 


agttgcagca 


tgctaacgag tcacagcgtt 


tatgcagaag 


1200 


tgatgcctgg 


atgcctgttg 


cagctgttta 


cggcactgcc ttgcagtgag 


cattgcagat 


1260 


aggggtgggg 


tgctttgtgt 


cgtgttccca 


cacgctgcca cacagccacc 


tcccggaaca 


1320 


catctcacct 


gctgggtact 


tttcaaacca 


tcttagcagt agtagatgag 


ttactatgaa 


1380 
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acagagaagt tcctcagttg gatattctca tgggatgtct tttttcccat gttgggcaaa 1440 

gtatgataaa'gcatctctat ttgtaaatta tgcacttgtt agttcctgaa tcctttctat 1500 

agcaccactt attgcagcag gtgtaggctc tggtgtggcc tgtgtctgtg cttcaatctt 1560 

ttaaagcttc tttggaaata cactgacttg attgaagtct cttgaagata gtaaacagta 1620 

cttacctttg atcccaatga aatcgagcat ttcagttgta aaagaattcc gcctattcat 1680 

accatgtaat gtaattttac acccccagtg ctgacacttt ggaatatatt caagtaatag 1740 

actttggcct caccctcttg tgtactgtat tttgtaatag aaaatatttt aaactgtgca 1800 

tatgattatt acattatgaa agagacattc tgctgatctt caaatgtaag aaaatgagga 1860 

gtgcgtgtgc ttttataaat acaagtgatt gcaaattagt gcaggtgtcc ttaaaaaaaa 1920 

aaaaaaaaag taatataaaa aggaccaggt gttttacaag tgaaatacat tcctatttgg 1980 

taaacagtta catttttatg aagattacca gcgctgctga ctttctaaac ataaggctgt 2040 

attgtcttcc tgtaccattg catttcctca ttcccaattt gcacaaggat gtctgggtaa 2100 

actattcaag aaatggcttt gaaatacagc atgggagctt gtctgagttg gaatgcagag 2160 

ttgcactgca aaatgtcagg aaatggatgt ctctcagaat gcccaactcc aaaggatttt 2220 

atatgtgtat atagtaagca gtttcctgat tccagcaggc caaagagtct gctgaatgtt 2280 

gtgttgccgg agacctgtat ttctcaacaa ggtaagatgg tatcctagca actgcggatt 2340 

ttaatacatt ttcagcagaa gtacttagtt aatctctacc tttagggatc gtttcatcat 2400 

ttttagatgt tatacttgaa atactgcata acttttagct ttcatgggtt cctttttttc 2460 

agcctttagg agactgttaa gcaatttgct gtccaacttt tgtgttggtc ttaaactgca 2520 

atagtagttt accttgtatt gaagaaataa agaccatttt tatattaaaa aatacttttg 2580 

tctgtcttca ttttgacttg tctgatatcc ttgcagtgcc cattatgtca gttctgtcag 2640 

atattcagac atcaaaactt aacgtgagct cagtggagtt acagctgcgg ttttgatgct 2700 

gttattattt ctgaaactag aaatgatgtt gtcttcatct gctcatcaaa cacttcatgc 2760 

agagtgtaag gctagtgaga aatgcataca tttattgata cttttttaaa gtcaactttt 2820 

tatcagattt ttttttcatt tggaaatata ttgttttcta gactgcatag cttctgaatc 2880 

tgaaatgcag tctgattggc atgaagaagc acagcactct tcatcttact taaacttcat 2940 

tttggaatga aggaagttaa gcaagggcac aggtccatga aatagagaca gtgcgctcag 3000 

gagaaagtga acctggattt ctttggctag tgttctaaat ctgtagtgag gaaagtaaca 3060 

cccgattcct tgaaagggct ccagctttaa tgcttccaaa ttgaaggtgg caggcaactt 3120 
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ggccactggt tatttactgc attatgtctc agtttcgcag ctaacctggc ttctccacta 3180 

ttgagcatgg actatagcct ggcttcagag gccaggtgaa ggttgggatg ggtggaagga 3240 

gtgctgggct gtggctgggg ggactgtggg gactccaagc tgagcttggg gtgggcagca 3300 

cagggaaaag tgtgggtaac tatttttaag tactgtgttg caaacgtctc atctgcaaat 3360 

acgtagggtg tgtactctcg aagattaaca gtgtgggttc agtaatatat ggatgaattc 3420 

acagtggaag cattcaaggg tagatcatct aacgacacca gatcatcaag ctatgattgg 3480 

aagcggtatc agaagagcga ggaaggtaag cagtcttcat atgttttccc tccacgtaaa 3540 

gcagtctggg aaagtagcac cccttgagca gagacaagga aataattcag gagcatgtgc 3600 

taggagaact ttcttgctga attctacttg caagagcttt gatgcctggc ttctggtgcc 3660 

ttctgcagca cctgcaaggc ccagagcctg tggtgagctg gagggaaaga ttctgctcaa 3720 

gtccaagctt cagcaggtca ttgtotttgc ttcttccccc agcactgtgc agcagagtgg 3780 

aactgatgtc gaagcctcct gtccactacc tgttgctgca ggcagactgc tctcagaaaa 3840 

agagagctaa ctctatgcca tagtctgaag gtaaaatggg. ttttaaaaaa gaaaacacaa 3900 

aggcaaaacc ggctgcccca tgagaagaaa gcagtggtaa acatggtaga aaaggtgcag 3960 

aagcccccag gcagtgtgac aggcccctcc tgccacctag aggcgggaac aagcttccct 4020 

gcctagggct ctgcccgcga agtgcgtgtt tctttggtgg gttttgtttg gcgtttggtt 4080 

ttgagattta gacacaaggg aagcctgaaa ggaggtgttg ggcactattt tggtttgtaa 4140 

agcctgtact tcaaatatat attttgtgag ggagtgtagc gaattggcca atttaaaata 4200 

aagttgcaag agattgaagg ctgagtagtt gagagggtaa cacgtttaat gagatcttct 4260 

gaaactactg cttctaaaca cttgtttgag tggtgagacc ttggataggt gagtgctctt 4320 

gttacatgtc tgatgcactt gcttgtcctt ttccatccac atccatgcat tccacatcca 4380 

cgcatttgtc acttatccca tatctgtcat atctgacata cctgtctctt cgtcacttgg 4440 

tcagaagaaa cagatgtgat. aatccccagc cgccccaagt ttgagaagat ggcagttgct 4500 

tctttccctt tttcctgcta agtaaggatt ttctcctggc tttgacacct cacgaaatag 4560 

tcttcctgcc ttacattctg ggcattattt caaatatctt tggagtgcgc tgctctcaag 4620 

tttgtgtctt cctactctta gagtgaatgc tcttagagtg aaagagaagg aagagaagat 4680 

gttggccgca gttctctgat gaacacacct ctgaataatg gccaaaggtg ggtgggtttc 4740 

tctgaggaac gggcagcgtt tgcctctgaa agcaaggagc tctgcggagt tgcagttatt 4800 

ttgcaactga tggtggaact ggtgcttaaa gcagattccc taggttccct gctacttctt 4860 
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ttccttcttg 


gcagtcagtt 


tatttctgac agacaaacag ccacccccac tgcaggctta 


4920 


gaaagtatgt 


ggctctgcct 


gggtgtgtta 


cagctctgcc ctggtgaaag gggattaaaa 


4990 


cgggcaccat 


tcatcccaaa 


caggatcctc attcatggat caagctgtaa ggaacttggg 


5040 


ctccaacctc 


aaaacattaa 


ttggagtacg aatgtaatta aaactgcatt ctcgcattcc 


5100 


taagtcattt 


agtctggact 


ctgcagcatg taggtcggca gctcccactt tctcaaagac 

r 


5160 


cactgatgga 


ggagtagtaa 


aaatggagac cgattcagaa caaccaacgg agtgttgccg 


5220 


aagaaactga 


tggaaataat 


gcatgaattg tgtggtggac atttttttta aatacataaa 


5280 


ctacttcaaa 


tgaggtcgga 


gaaggtcagt gttttattag cagccataaa accaggtgag 


5340 


cgagtaccat 


ttttctctac 


aagaaaaacg 


attctgagct ctgcgtaagt ataagttctc - 


5400 


catagcggct 


gaagctcccc 


cctggctgcc 


^tgccatctca gctggagtgc agtgccattt 


5460 


ccttggggtt 


tctctcacag 


cagtaatggg 


acaatacttc acaaaaattc tttcttttcc 


5520 


tgtcatgtgg 


gatccctact 


gtgccctcct 


ggttttacgt taccccctga ctgttccatt 


5580 


cagcggtttg 


gaaagagaaa 


aagaatttgg 


aaataaaaca tgtctacgtt atcacctcct 


5640 


ccagcatttt 


ggtttttaat 


tatgtcaata 


actggcttag atttggaaat gagagggggt 


5700 


tgggtgtatt 


accgaggaac 


aaaggaaggc 


ttatataaac tcaagtcttt tatttagaga 


5760 


actggcaagc 


tgtcaaaaac 


aaaaaggcct 


taccaccaaa ttaagtgaat agccgctata 


5820 


gccagcaggg 


ccagcacgag 


ggatggtgca 


ctgctggcac tatgccacgg cctgcttgtg 


5880 


actctgagag 


caactgcttt 


ggaaatgaca 


gcacttggtg caatttcctt t'gtttcagaa 


5940 


tgcgtagagc 


gtgtgcttgg 


cgacagtttt 


tctagttagg ccacttcttt tttccttctc 


6000 


tcctcattct 


cctaagcatg 


tctccatgct 


ggtaatccca gtcaagtgaa cgttcaaaca 


6060 


atgaatccat 


cactgtagga 


ttctcgtggt 


gatcaaatct ttgtgtgagg tctataaaat 


6120 


atggaagctt 


atttattttt 


cgttcttcca 


tatcagtctt ctctatgaca attcacatcc 


6180 


accacagcaa 


attaaaggtg 


aaggaggctg 


gtgggatgaa gagggtcttc tagctttacg 


6240 


ttcttccttg 


caaggccaca 


ggaaaatgct 


gagagctgta gaatacagcc tggggtaaga 


6300 


agttcagtct 


cctgctggga 


cagctaaccg 


catcttataa ccccttctga gactcatctt 


6360 


aggaccaaat 


agggtctatc 


tggggttttt 


gttcctgctg ttcctcctgg aaggctatct 


6420 


cactatttca 


ctgctcccac 


ggttacaaac 


caaagataca gcctgaattt tttctaggcc 


.6480 


acattacata 


aatttgacct 


ggtaccaata 


ttgttctcta tatagttatt tccttcccca 


6540 


ctgtgtttaa 


ccccttaagg 


cattcagaac 


aactagaatc atagaatggt ttggattgga. 


6600 
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aggggcctta 


aacatcatcc 


atttccaacc 


ctctgccatg 


ggctgcttgc 


cacccactgg 


6660 


ctcaggctgc 


ccagggcccc 


atccagcctg 


gccttgagca 


cctccaggga 


tggggcaccc 


6720 


acagcttctc 


tgggcagcct 


gtgccaacac 


ctcaccactc 


tctgggtaaa 


gaattctctt 


6780 


ttaacatcta 


atctaaatct 


cttctctttt 


agtttaaagc 


cattcctctt 


tttcccgttg 


6840 


ctatctgtcc 


aagaaatgtg. 


tattggtctc 


cctcctgctt 


ataagcagga 


agtactggaa 


6900 


ggctgcagtg 


aggtctcccc 


acagccttct 


cttctccagg 


ctgaacaagc 


ccagctcctt 


6960 


cagcctgtct~ 


tcgtaggaga 


tcatcttagt 


ggccctcctc 


tggacccatt 


ccaacagttc 


7020 


cacggctttc 


ttgtggagcc 


ccaggtctgg 


atgcagtact 


tcagatgggg 


ccttacaaag 


7080 


gcagagcaga 


tggggacaat 


cgcttacccc 


tccctgctgg 


ctgcccctgt 


tttgatgcag 


7140 


cccagggtac 


tgttggcctt 


tcaggctccc 


agaccccttg 


ctgatttgtg 


tcaagctttt 


7200 


catccaccag 


aacccacgct 


tcctggttaa 


tacttctgcc 


ctcacttctg 


taagcttgtt 


7260 


tcaggagact 


tccattcttt 


aggacagact 


gtgttacacc 


tacctgccct 


attcttgcat 


7320 


atatacattt 


cagttcatgt 


ttcctgtaac 


aggacagaat 


atgtattcct 


ctaacaaaaa 


7380 


tacatgcaga 


attcctagtg 


ccatctcagt 


agggttttca 


tggcagtatt 


agcacatagt 


7440 


caatttgctg 


caagtacctt 


ccaagctgcg 


gcctcccata 


aatcctgtat 


ttgggatcag 


7500 


ttaccttttg 


gggtaagctt 


ttgtatctgc 


agagaccctg 


ggggttctga 


tgtgcttcag 


7560 


ctctgctctg 


ttctgactgc 


accattttct 


agatcaccca 


gttgttcctg 


tacaacttcc 


7620 


ttgtcctcca 


tcctttccca 


gcttgtatct 


ttgacaaata 


caggcctatt 


tttgtgtttg 


7680 


cttcagcagc 


catttaattc 


ttcagtgtca 


tcttgttctg 


ttgatgccac 


tggaacagga 


7740 


ttttcagcag 


tcttgcaaag 


aacatctagc 


tgaaaacttt 


ctgccattca 


atattcttac 


7800 


cagttcttct 


tgtttgaggt 


gagccataaa 


ttactagaac 


ttcgtcactg 


acaagtttat 


7860 


gcattttatt 


acttctatta 


tgtacttact 


ttgacataac 


acagacacgc 


acatattttg 


7920 


ctgggatttc 


cacagtgtct 


ctgtgtcctt 


cacatggttt 


tactgtcata 


cttccgttat 


7980 


aaccttggca 


atctgcccag 


ctgcccatca 


caagaaaaga 


gattcctttt 


ttattacttc 


8040 


tcttcagcca 


ataaacaaaa 


tgtgagaagc 


ccaaacaaga 


acttgtgggg 


caggctgcca 


8100 


tcaagggaga 


gacagctgaa 


gggttgtgta 


gctcaataga 


attaagaaat 


aataaagctg 


8160 


tgtcagacag 


ttttgcctga 


tttatacagg 


cacgccccaa 


gccagagagg 


ctgtctgcca 


8220 


aggccacctt 


gcagtccttg 


gtttgtaaga 


taagtcatag 


gtaacttttc 


tggtgaattg 


8280 


cgtggagaat 


catgatggca 


gttcttgctg 


tttactatgg 


taagatgcta 


aaataggaga 


8340 
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cagcaaagta 


acacttgctg 


ctgtaggtgc 


tctgctatcc 


agacagcgat 


ggcactcgca 


8400 


caccaagatg 


agggatgctc 


ccagctgacg 


gatgctgggg 


cagtaacagt 


gggtcccatg 


8460 


ctgcctgctc 


attagcatca 


cctcagccct 


caccagccca 


tcagaaggat 


catcccaagc 


8520 


tgaggaaagt 


tgctcatctt 


cttcacatca 


tcaaaccttt 


ggcctgactg 


atgcctcccg 


8580 


gatgcttaaa 


tgtggtcact 


gacatcttta 


tttttctatg 


atttcaagtc 


agaacctccg 


8640 


gatcaggagg 


gaacacatag 


tgggaatgta 


ccctcagctc 


caaggccaga 


tcttccttca 


8700 


atgatcatgc 


atgctactta 


ggaaggtgtg 


tgtgtgtgaa 


tgtagaattg 


cctttgttat 


8760 


tttttcttcc 


tgctgtcagg 


aacattttga 


ataccagaga 


aaaagaaaag 


tgctcttctt 


8820 


ggcatgggag 


gagttgtcac 


acttgcaaaa 


taaaggatgc 


agtcccaaat 


gttcataatc 


8880 


tcagggtctg 


aaggaggatc 


agaaactgtg 


tatacaattt 


caggcttctc 


tgaatgcagc 


8940 


ttttgaaagc 


tgttcctggc 


cgaggcagta 


ctagtcagaa 


ccctcggaaa 


caggaacaaa 


9000 


tgtcttcaag 


gtgcagcagg 


aggaaacacc 


ttgcccatca 


tgaaagtgaa 


taaccactgc 


9060 


cgctgaagga 


atccagctcc 


tgtttgagca 


ggtgctgcac 


actcccacac 


tgaaacaaca 


9120 


gttcattttt 


ataggacttc 


caggaaggat 


cttcttctta 


agcttcttaa 


ttatggtaca 


9180 


tctccagttg 


gcagatgact 


atgactactg 


acaggagaat 


gaggaactag 


ctgggaatat 


9240 


ttctgtttga 


ccaccatgga 


gtcacccatt 


tctttactgg 


tatttggaaa 


taataattct 


9300 


gaattgcaaa 


gcaggagtta 


gcgaagatct 


tcatttcttc 


catgttggtg 


acagcacagt 


9360 


tctggctatg 


aaagtctgct 


tacaaggaag 


aggataaaaa 


tcatagggat 


aataaatcta 


9420 


agtttgaaga 


caatgaggtt 


ttagctgcat 


ttgacatgaa 


gaaattgaga 


cctctactgg 


9480 


atagctatgg 


tatttacgtg 


tctttttgct 


tagttactta 


ttgaccccag 


ctgaggtcaa 


9540 


gtatgaactc 


aggtctctcg 


ggctactggc 


atggattgat 


tacatacaac 


tgtaatttta 


9600 


gcagtgattt 


agggtttatg 


agtacttttg 


cagtaaatca 


tagggttagt 


aatgttaatc 


9660 


tcagggaaaa 


aaaaaaaaag 


ccaaccctga 


cagacatccc 


agctcaggtg 


gaaatcaagg 


9720 


atcacagctc 


agtgcggtcc 


cagagaacac 


agggactctt 


ctcttaggac 


ctttatgtac 


9780 


agggcctcaa 


gataactgat 


gttagtcaga 


agactttcca 


ttctggccac 


agttcagctg 


9840 


aggcaatcct 


ggaattttct 


ctccgctgca 


cagttccagt 


catcccagtt 


tgtacagttc , 


9900 


tggcactttt 


tgggtcaggc 


cgtgatccaa 


ggagcagaag 


ttccagctat 


ggtcagggag 


9960 


tgcctgaccg 


tcccaactca 


ctgcactcaa 


acaaaggcga 


aaccacaaga 


gtggcttttg 


10020 


ttgaaattgc 


agtgtggccc 


agaggggctg 


caccagtact 


ggattgacca 


cgaggcaaca 


10080 
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ttaatcctca gcaagtgcaa tttgcagcca ttaaattgaa ctaactgata ctacaatgca 10140 

atcagtatca acaagtggtt tggcttggaa gatggagtct aggggctcta caggagtagc 10200 

tactctctaa tggagttgca ttttgaagca ggacactgtg aaaagctggc ctcctaaaga 10260 

ggctgctaaa cattagggtc aattttccag tgcactttct gaagtgtctg cagttcccca 10320 

tgcaaagctg cccaaacata gcacttccaa ttgaatacaa ttatatgcag gcgtactgct 10380 

tcttgccagc actgtccttc tcaaatgaac tcaacaaaca atttcaaagt ctagtagaaa 10440 

gtaacaagct ttgaatgtca ttaaaaagta tatctgcttt cagtagttca gcttatttat 10500 

gcccactaga aacatcttgt acaagctgaa cactggggct ccagattagt ggtaaaacct 10560 

actttataca atcatagaat catagaatgg cctgggttgg aagggacccc aaggatcatg 10620 

aagatccaac acccccgcca caggcagggc caccaacctc cagatctggt actagaccag 10680 

gcagcccagg gctccatcca acctggccat gaacacctcc agggatggag catccacaac 10740 

ctctctgggc agcctgtgcc agcacctcac caccctctct gtgaagaact tttccctgac 10800 

atccaatcta agccttccct ccttgaggtt agatccactc ccccttgtgc tatcactgtc 10860 

tactcttgta aaaagttgat tctcctcctt tttggaaggt tgcaatgagg tctccttgca . 10920 

gccttcttct cttctgcagg atgaacaagc ccagctccct cagcctgtct ttataggaga 10980 

ggtgctccag ccctctgatc atctttgtgg ccctcctctg gacccgctcc aagagctcca 11040 

catctttcct gtactggggg ccccaggcct gaatgcagta ctccagatgg ggcctcaaaa 11100 

gagcagagta aagagggaca atcaccttcc tcaccctgct ggccagccct cttctgatgg 11160 

agccctggat acaactggct ttctgagctg caacttctcc ttatcagttc cactattaaa 11220 

acaggaacaa tacaacaggt gctgatggcc agtgcagagt ttttcacact tcttcatttc 11280 

ggtagatctt agatgaggaa cgttgaagtt gtgcttctgc gtgtgcttct tcctcctcaa 11340 

atactcctgc ctgatacctc accccacctg ccactgaatg gctccatggc cccctgcagc- 11400 

cagggccctg atgaacccgg cactgcttca gatgctgttt aatagcacag tatgaccaag 11460 

ttgcacctat gaatacacaa acaatgtgtt gcatccttca gcacttgaga agaagagcca 11520 

aatttgcatt gtcaggaaat ggtttagtaa ttctgccaat taaaacttgt ttatctacca 11580 

tggctgtttt tatggctgtt agtagtggta cactgatgat gaacaatggc tatgcagtaa 11640 

aatcaagact gtagatattg caacagacta taaaattcct ctgtggctta gccaatgtgg 11700 

tacttcccac attgtataag aaatttggca agtttagagc aatgtttgaa gtgttgggaa 11760 

atttctgtat actcaagagg gcgtttttga caactgtaga acagaggaat caaaaggggg 11820 
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tgggaggaag ttaaaagaag aggcaggtgc aagagagctt gcagtcccgc tgtgtgtacg 11880. 
acactggcaa catgaggtct ttgctaatct tggtgctttg- cttcctgccc ctggctgcct 11940 
taggg 11945 

<210> 8 

<211> 285 

<212> DNA 

<213> SV40 

<220> 

<221> misc_feature 

<222> (1)..(285) 

<223> SV40 Polyadenylation Sequence 



60 
120 
180 



<400> 8 . 
aaagtctaga gtcggggcgg ccggccgctt cgagcagaca tgataagata cattgatgag 

tttggacaaa ccacaactag aatgcagtga aaaaaatgct ttatttgtga aatttgtgat 

gctattgctt tatttgtaac cattataagc tgcaataaac _ aagttaacaa caacaattgc 

attcatttta tgtttcaggt tcagggggag gtgtgggagg ttttttaaag caagtaaaac 240 

ctctacaaat gtggtaaaat cgataaggat ccgtcgagcg gccgc 285 

<210> 9 

<211> 5972 

<212> DNA 

<213> Gallus gallus 

<220> 

<221> mis cofeature 

<222> (1)..(5972) 

<223> Lyso2yme 3prime domain 

cgcgtggtag gtggcggggg gttcccagga gagcccccag cgcggacggc agcgccgtca bu 

ctcaccgctc cgtctccctc cgcccagggt cgcctggcgc aaccgctgca agggcaccga 120 

cgtccaggcg tggatcagag gctgccggct gtgaggagct gccgcgcccg gcccgcccgc 180 

tgcacagccg gccgctttgc gagcgcgacg ctacccgctt ggcagtttta aacgcatccc 240 

tcattaaaac gactatacgc aaacgccttc ccgtcggtcc gcgtctcttt ccgccgccag 300 

ggcgacactc gcggggaggg cgggaagggg gccgggcggg agcccgcggc caaccgtcgc 360 

cccgtgacgg caccgccccg cccccgtgac gcggtgcggg cgccggggcc gtggggctga 420 

-gcgctgcggc ggggccgggc cgggccgggg cgggagctga gcgcggcgcg gctgcgggcg 480 
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gcgccccctc cggtgcaata tgttcaagag aatggctgag ttcgggcctg actccggggg - 540 

cagggtgaag gtgcggcgcg ggcggaggga cggggcgggc gcggggccgc ccggcgggtg 600 

ccggggcctc tgccggcccg cccggctcgg gctgctgcgg cgcttacggg cgcgcttctc 660 

gccgctgccg cttctcttct ctcccgcgca agggcgtcac catcgtgaag ccggtagtgt 720 

acgggaacgt ggcgcggtac ttcgggaaga agagggagga ggacgggcac acgcatcagt 780 

ggacggttta cgtgaagccc tacaggaacg aggtagggcc cgagcgcgtc ggccgccgtt 840 

ctcggagcgc cggagccgtc agcgccgcgc ctgggtgcgc tgtgggacac agcgagcttc 900 

tctcgtagga catgtccgcc tacgtgaaaa aaatccagtt caagctgcac gagagctacg 960 

ggaatcctct ccgaggtggg tgttgcgtcg gggggtttgc tccgctcggt cccgctgagg 1020 

ctcgtcgccc tcatctttct ttcgtgccgc agtcgttacc aaaccgccgt acgagatcac 1080 

cgaaacgggc tggggcgaat ttgaaatcat catcaagata tttttcattg atccaaacga 1140 

gcgacccgta agtacgctca gcttctcgta gtgcttcccc cgtcctggcg gcccggggct 1200 

gggctgctcg ctgctgccgg tcacagtccc gccagccgcg gagctgactg agctcccttt 1260 

cccgggacgt gtgctctgtg ttcggtcagc gaggctatcg ggagggcttt ggctgcattt 1320 

ggcttctctg gcgcttagcg caggagcacg ttgtgctacg cctgaactac agctgtgaga 1380 

aggccgtgga aaccgctctc aaactgattt attggcgaaa tggctctaaa ctaaatcgtc 1440 

tcctctcttt ggaaatgctt tagagaaggt ctctgtggta gttcttatgc atctatccta 1500 

aagcacttgg ccagacaatt taaagacatc aagcagcatt tatagcaggc acgtttaata 1560 

acgaatactg aatttaagta actctgctca cgttgtatga cgtttatttt cgtattcctg 1620 

aaagccatta aaatcctgtg cagttgttta gtaagaacag ctgccactgt tttgtatcta 1680 

ggagataact ggtgtttccc tacagttctc aagctgataa aactctgtct ttgtatctag 1740 

gtaaccctgt atcacttgct gaagcttttt cagtctgaca ccaatgcaat cctgggaaag 1800 

aaaactgtag tttctgaatt ctatgatgaa atggtatgaa aattttaatg tcaaccgagc 1860 

ctgactttat ttaaaaaaaa ttattgatgg tgctgtgtat tttggtcctt ccttagatat 1920 

ttcaagatcc tactgccatg atgcagcaac tgctaacgac gtcccgtcag ctgacacttg 1980 

gtgcttacaa gcatgaaaca gagtgtaagt gcaaaatgag gataccttcg ccgaccgtca 2040 

ttcactacta atgttttctg tgggatgtga tcgtacagtg agtttggctg tgtgaaattt 2100 

gaatagcttg gtattggcag tgatgacgtg atcgatgcct tgcttatcat gtttgaaatg 2160 

aagtagaata aatgcagcct gctttatttg agatagtttg gttcatttta tggaatgcaa 2220 
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gcaaagatta tacttcctca 


ctgaattgca 


ctgtccaaag 


gtgtgaaatg 


tgtggggatc 


2280 


tggaggaccg tgaccgaggg 


acattggatc 


gctatctccc 


atttcttttg 


ctgttaccag 


2340 


ttcagatttt cttttcacct 


agtctttaat 


tcccagggtt 


ttgttttttc 


cttggtcata 


2400 


gtttttgttt ttcactctgg 


caaatgatgt 


tgtgaattac 


actgcttcag 


ccacaaaact 


2460 


gatggactga atgaggtcat 


caaacaaact 


tttcttcttc 


cgtatttcct 


tttttttccc 


2520 


ccacttatca tttttactgc 


tgttgttgag 


tctgtaaggc 


taaaagtaac 


tgttttgtgc 


2580 


tttttcagga cgtgtgcttt 


ccaaattact 


gccacatata 


taaagaaagg 


ttggaatttt 


2640 


aaagataatt catgtttctt 


cttctttttt 


gccaccacag 


ttgcagatct 


tgaagtaaaa 


2700 


accagggaaa agctggaagc 


tgccaaaaag 


aaaaccagtt 


ttgaaattgc 


tgagcttaaa 


2760 


gaaaggttaa aagcaagtcg 


tgaaaccatc 


aactgcttaa 


agagtgaaat 


cagaaaactc 


2820 


gaagaggatg atcagtctaa 


agatatgtga 


tgagtgttga 


cttggcaggg 


agcctataat 


2880 


gagaatgaaa ggacttcagt 


cgtggagttg 


tatgcgttct 


ctccaattct 


gtaacggaga 


2940 


ctgtatgaat ttcatttgca 


aatcactgca 


gtgtgtgaca 


actgactttt 


tataaatggc 


3000 


agaaaacaag aatgaatgta 


tcctcatttt 


atagttaaaa 


tctatgggta 


tgtactggtt 


3060 


tatttcaagg agaatggatc 


gtagagactt 


ggaggccaga 


ttgctgcttg 


tattgactgc 


3120 


atttgagtgg tgtaggaaca 


ttttgtctat 


ggtcccgtgt 


tagtttacag 


aatgccactg 


3180 


ttcactgttt tgttttgtat 


tttacttttt 


ctactgcaac 


gtcaaggttt 


taaaagttga 


3240 


aaataaaaca tgcaggtttt 


ttttaaatat 


ttttttgtct 


ctatccagtt 


tgggcttcaa 


3300 


gtattattgt taacagcaag 


tcctgattta 


agtcagaggc 


tgaagtgtaa 


tggtattcaa 


3360 


gatgcttaag tctgttgtca 


gcaaaacaaa 


agagaaaact 


tcataaaatc 


aggaagttgg 


3420 


catttctaat aacttcttta 


tcaacagata 


agagtttcta 


gccctgcatc 


tactttcact 


3480 


tatgtagttg atgcctttat 


attttgtgtg 


tttggatgca 


ggaagtgatt 


cctactctgt 


3540 


tatgtagata ttctatttaa 


cacttgtact 


ctgctgtgct 


tagcctttcc 


ccatgaaaat 


3600 


tcagcggctg taaatccccc 


tcttcttttg 


tagcctcata 


cagatggcag 


accctcaggc 


3660 


ttataaaggc' ttgggcatct 


tctttactgc 


tttgagattc 


tgtgttgcag 


taacctctgc 


3720 


cagagaggag aaaagcccca 


caaacctcat 


ccccttcttc 


tatagcaatc 


agtattacta 


3780 


atgctttgag aacagagcac 


tggtttgaaa 


cgtttgataa 


ttagcattta 


acatggcttg 


3840 


gtaaagatgc agaactgaaa 


cagctgtgac 


agtatgaact 


cagtatggag 


acttcattaa 


-3900 


gacaaacagc tgttaaaatc 


aggcatgttt 


cattgaggag 


gacggggcaa 


cttgcaccag 


3960 



-22- 



WO 03/024199 



PCT/US02/30156 



tggtgcccac acaaatcctt cctggcgctg cagaccaatt tttctggcat tctgactgcc 4020 

gttgctgctg gtcacagaga gcaactattt ttatcagcca caggcaattt gcttgtagta 4080 

ttttccaagt gttgtaggta agtataaatg catcggctcc agagcacttt gagtatactt 4140 

attaaaaaca taaatgaaag acaaattagc tttgcttggg tgcacagaac atttttagtt 4200 

ccagcctgct ttttggtaga agccctcttc tgaggctaga actgactttg acaagtagag 4260 

aaactggcaa cggagctatt gctatcgaag gatccttgtt aacaaagtta atcgtctttt 4320 

aaggtttggt ttattcatta aatttgcttt taagctgtag ctgaaaaaga acgtgctgtc- 4380 

ttccatgcac caggtggcag ctctgtgcaa agtgctctct ggtctcacca gccttttaat 4440 

tgccgggatt ctggcacgtc tgagagggct cagactggct tcgtttgttt gaacagcgtg 4500 

tactgctttc tgtagacatg gccggtttct ctcctgcagc ttatgaaact gttcacactg 4560 

aacacactgg aacaggttgc ccaaggaggc cgtggatgcc ccatccctgg aggcattcaa 4 620 

ggccaggctg gatgtggctc tgggcagcct ggtctggtgg ttggcgatcc tgcacatagc 4680 

agcggggttg aaactcgatg atcactgtgg tccttttcaa cccaggctat tctatgattc 4740 

tatgattcaa cagcaaatca tatgtactga gagaggaaac aaacacaagt gctactgttt 4800 

gcaagttttg ttcatttggt aaaagagtca ggttttaaaa ttcaaaatct gtctggtttt 4860 

ggtgtttttt tttttttatt tattatttct ttggggttct ttttgatgct ttatctttct 4920 

ctgccaggac tgtgtgacaa tgggaacgaa aaagaacatg ccaggcactg tcctggattg 4980 

cacacgctgg ttgcactcag tagcaggctc agaactgcca gtctttccac agtattactt 5040 

tctaaaccta attttaatag cgttagtaga cttccatcac tgggcagtgc ttagtgaatg 5100 

ctctgtgtga acgttttact tataagcatg ttggaagttt tgatgttcct ggatgcagta 5160 

gggaaggaca gattagctat gtgaaaagta gattctgagt atcggggtta caaaaagtat 5220 

agaaacgatg agaaattctt gttgtaacta attggaattt ctttaagcgt tcacttatgc 5280 

tacattcata gtatttccat ttaaaagtag gaaaaggtaa aacgtgaaat cgtgtgattt 5340 

tcggatggaa caccgccttc ctatgcacct gaccaacttc cagaggaaaa gcctattgaa 5400 

agccgagatt aagccaccaa aagaactcat ttgcattgga atatgtagta tttgccctct 5460 

tcctcccggg taattactat actttatagg gtgcttatat gttaaatgag tggctggcac 5520 

tttttattct cacagctgtg gggaattctg ,tcctctagga cagaaacaat tttaatctgt 5580 

tccactggtg actgctttgt cagcacttcc acctgaagag atcaatacac tcttcaatgt 5640 

ctagttctgc aacacttggc aaacctcaca tcttatttca tactctcttc atgcctatgc 5700 



-23- 



WO 03/024199 PCT7US02/30156 

ttattaaagc aataatctgg gtaatttttg ttttaatcac tgtcctgacc ccagtgatga 5760 

ccgtgtccca cctaaagctc aattcaggtc ctgaatctct tcaactctct atagctaaca 5820 ■ 

tgaagaatct tcaaaagtta ggtctgaggg acttaaggct aactgtagat gttgttgcct 5880 

ggtttctgtg ctgaaggccg tgtagtagtt agagcattca acctctagaa gaagcttggc 5940 

cagctggtcg acctgcagat ccggccctcg ag 5972 

<210> 10 

<211> 18391 

<212> DNA 

<213> Gallua gallus 

<220> 

<221> mis cofeature 

<222> (1) .7(237) 

<223> 5prime matrix (scaffold) attachment region (MAR) 
<220> 

<221> misc_feature 

<222> (261) (1564) 

<223> Sprime matrix (scaffold) attachment region (MAR) 
<220> 

<221> misc^feature 

<222> (1565) (1912) 

<223> Sprime matrix (scaf fold) . attachment region (MAR) 
<220> 

<221> misc_feature 

<222> (1930) (2012) 

<223> 5prime matrix (scaffold) attachment region (MAR) 
<220> 

<221> misc feature 

<222> (2013) (2671) 

<223> Intrinsically curved DNA 

<220> 

<221> mis cofeature 

<222> (5848) (5934) 

<223> Transcription enhancer 

<220> 

<221> misc_feature 

<222> (9160) (9325) k 

<223> Transcription enhancer ' 



-24- 



WO 03/024199 



PCT/US02/30156 



<220> 

<221> misc_feature 

<222> (9326) (9626) 

<223> Negative regulatory element 

<220> 

<221> mis cofeature 

<222> (9621) . . (9660) 

<223> Hormone response element 

<220> 

<221> misc_feature 

<222> (9680) (10060) - 

<223> Hormone response element 

<220> 

<221> misc_feature 

<222> (10576) (10821) 

<223> Chicken CRl Repeat Sequence 

<220> 

<221> mis cofeature 

<222> (10926) (11193) 

<223> Chicken CRl Repeat Sequence 

<220> 

<221> mis cofeature 

<222> (11424) . . (11938) 

<223> Lysozyme Proximal Promoter and Lysozyme Signal Peptide 
<220> 

<221> misc_feature 

<222> (11946) (12443) 

<223> human interferon alpha 2b codon-optimized for expression in chick 
ens 



<220> 

<221> mis cofeature 

<222> (12464) (18391) 

<223> Chicken Lysozyme 3prime domain 

<400> 10 

tgccgccttc tttgatattc actctgttgt atttcatctc ttcttgccga tgaaaggata 60 

taacagtctg tataacagtc tgtgaggaaa tacttggtat ttcttctgat cagtgttttt 120 

ataagtaatg ttgaatattg gataaggctg tgtgtccttt gtcttgggag acaaagccca 180 

cagcaggtgg tggttggggt ggtggcagct cagtgacagg agaggttttt ttgcctgttt' 240 
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tttttttttt 


tttttttttt 


aagtaaggtg 


ttcttttttc ttagtaaatt ttctactgga 


300 


ctgtatgttt 


tgacaggtca 


gaaacatttc ttcaaaagaa gaaccttttg gaaactgtac 


360 


agcccttttc 


tttcattccc 


tttttgcttt ctgtgccaat gcctttggtt ctgattgcat 


420 


tatggaaaac 


gttgatcgga 


acttgaggtt 


tttatttata gtgtggcttg aaagcttgga 


480 


tagctgttgt 


tacacgagat 


accttattaa 


gtttaggcca gcttgatgct ttattttttc 


540 


cctttgaagt 


agtgagcgtt 


ctctggtttt 


tttcctttga aactggtgag gcttagattt 


600 


ttctaatggg 


attttttacc 


tgatgatcta 


gttgcatacc caaatgcttg taaatgtttt 


660 


cctagttaac 


atgttgataa 


cttcggattt 


acatgttgta tatacttgtc atctgtgttt 


720 


ctagtaaaaa 


tatatggcat 


ttatagaaat 


acgtaattcc tgatttcctt tttttttatc 


780 


tctatgctct 


gtgtgtacag 


gtcaaacaga 


cttcactcct atttttattt atagaatttt 


, 840 


atatgcagtc 


tgtcgttggt 


tcttgtgttg 


taaggataca gccttaaatt tcctagagcg 


900 


atgctcagta 


aggcgggttg 


tcacatgggt 


tcaaatgtaa aacgggcacg tttggctgct 


960 


gccttcccga 


gatccaggac 


actaaactgc 


ttctgcactg aggtataaat cgcttcagat 


1020 


cccagggaag 


tgcagatcca 


cgtgcatatt 


cttaaagaag aatgaatact ttctaaaata 


1080 


ttttggcata 


ggaagcaagc 


tgcatggatt 


tgtttgggac ttaaattatt ttggtaacgg - 


' 1140 


agtgcatagg 


ttttaaacac 


agttgcagca 


tgctaacgag tcacagcgtt tatgcagaag 


1200 


tgatgcctgg 


atgcctgttg 


cagctgttta 


cggcactgcc ttgcagtgag cattgcagat 


1260 


aggggtgggg 


tgctttgtgt 


cgtgttccca 


cacgctgcca cacagccacc tcccggaaca 


1320 


catctcacct 


gctgggtact 


tttcaaacca 


tcttagcagt agtagatgag ttactatgaa 


1380 


acagagaagt 


tcctcagttg 


gatattctca 


tgggatgtct tttttcccat gttgggcaaa 


1440 


gtatgataaa 


gcatctctat 


ttgtaaatta 


tgcacttgtt agttcctgaa tcctttctat 


1500 


agcaccactt 


attgcagcag 


gtgtaggctc 


tggtgtggcc tgtgtctgtg cttcaatctt 


1560 


ttaaagcttc 


tttggaaata 


cactgacttg 


attgaagtct cttgaagata gtaaacagta 


1620 


cttacctttg 


atcccaatga 


aatcgagcat 


ttcagttgta aaagaattcc gcctattcat 


1680 


accatgtaat 


gtaattttac 


acccccagtg 


ctgacacttt ggaatatatt caagtaatag 


1740 


actttggcct 


caccctcttg 


tgtactgtat 


tttgtaatag aaaatatttt aaactgtgca 


1800 


tatgattatt 


acattatgaa 


agagacattc 


tgctgatctt caaatgtaag aaaatgagga 


1860 


gtgcgtgtgc 


ttttataaat 


acaagtgatt 


gcaaattagt gcaggtgtcc ttaaaaaaaa 


1920 


aaaaaaaaag 


taatataaaa 


aggaccaggt 


gttttacaag tgaaatacat tcctatttgg 


1980 
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taaacagtta 


catttttatg 


aagattacca 


gcgctgctga ctttctaaac ataaggctgt 


2040 


attgtcttcc 


tgtaccattg 


catttcctca 


ttcccaattt gcacaaggat gtctgggtaa 


2100 


actattcaag 


aaatggcttt 


gaaatacagc 


atgggagctt gtctgagttg gaatgcagag 


2160 


ttgcactgca 


aaatgtcagg 


aaatggatgt 


ctctcagaat gcccaactcc aaaggatttt. 


2220 


atatgtgtat 


atagtaagca 


gtttcctgat 


tccagcaggc caaagagtct gctgaatgtt 


2280 


gtgttgccgg 


agacctgtat 


ttctcaacaa 


ggtaagatgg tatcctagca actgcggatt 


2340 


ttaatacatt 


ttcagcagaa 


gtacttagtt 


aatctctacc tttagggatc gtttcatcat 


2400 


ttttagatgt 


tatacttgaa 


atactgcata 


acttttagct ttcatgggtt cctttttttc 


2460 


agcctttagg 


agactgttaa 


gcaatttgct 


gtccaacttt tgtgttggtc ttaaactgca 


2520 


atagtagttt 


accttgtatt 


gaagaaataa 


agaccatttt tatattaaaa aatacttttg 


2580 


tctgtcttca 


ttttgacttg 


tctgatatcc 


ttgcagtgcc cattatgtca gttctgtcag 


2640 


atattcagac 


atcaaaactt 


aacgtgagct 


cagtggagtt acagctgcgg ttttgatgct 


2700 


gttattattt 


ctgaaactag 


aaatgatgtt 


gtcttcatct gctcatcaaa cacttcatgc 


2760 


agagtgtaag 


gctagtgaga 


aatgcataca 


tttattgata' cttttttaaa gtcaactttt . 


2820 


tatcagattt 


ttttttcatt 


tggaaatata 


ttgttttcta gactgcatag cttctgaatc 


2880 


tgaaatgcag 


tctgattggc 


atgaagaagc 


acagcactct tcatcttact taaacttcat 


2940 


tttggaatga 


aggaagttaa 


gcaagggcac 


aggtccatga aatagagaca gtgcgctcag 


3000 


gagaaagtga 


acctggattt 


ctttggctag 


tgttctaaat ctgtagtgag gaaagtaaca 


3060 


cccgattcct 


tgaaagggct 


ccagctttaa 


tgcttccaaa ttgaaggtgg caggcaactt 


3120 


ggccactggt 


tatttactgc 


attatgtctc 


agtttcgcag ctaacctggc ttctccacta 


3180 


ttgagcatgg 


actatagcct 


ggcttcagag 


gccaggtgaa ggttgggatg ggtggaagga 


3240 


gtgctgggct 


gtggctgggg 


ggactgtggg 


gactccaagc tgagcttggg gtgggcagca 


3300 


cagggaaaag 


tgtgggtaac 


tatttttaag 


tactgtgttg caaacgtctc atctgcaaat 


3360 


acgtagggtg 


tgtactctcg 


aagattaaca 


gtgtgggttc agtaatatat ggatgaattc 


3420 


acagtggaag 


cattcaaggg 


tagatcatct 


aacgacacca gatcatcaag ctatgattgg 


3480 


aagcggtatc 


agaagagcga 


ggaaggtaag 


cagtcttcat atgttttccc tccacgtaaa 


3540 


gcagtctggg 


aaagtagcac 


cccttgagca 


gagacaagga aataattcag gagcatgtgc 


3600 


taggagaact 


ttcttgctga 


attctacttg 


caagagcttt gatgcctggc ttctggtgcc 


3660 


ttctgcagca 


cctgcaaggc 


ccagagcctg 


tggtgagctg gagggaaaga ttctgctcaa 


3720 
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gtccaagctt cagcaggtca ttgtctttgc ttcttccccc agcactgtgc agcagagtgg 3790 

aactgatgtc gaagcctcct gtccactacc tgttgctgca ggcagactgc tctcagaaaa 3840 

agagagctaa ctctatgcca tagtctgaag gtaaaatggg ttttaaaaaa gaaaacacaa 3900 

aggcaaaacc ggctgcccca tgagaagaaa gcagtggtaa acatggtaga aaaggtgcag 3960 

aagcccccag gcagtgtgac aggcccctcc tgccacctag aggcgggaac aagcttccct 4020 

gcctagggct ctgcccgcga agtgcgtgtt tctttggtgg gttttgtttg gcgtttggtt 4080 

ttgagattta gacacaaggg aagcctgaaa ggaggtgttg ggcactattt tggtttgtaa 4140 

agcctgtact tcaaatatat attttgtgag ggagtgtagc gaattggcca atttaaaata 4200 

aagttgcaag agattgaagg ctgagtagtt gagagggtaa cacgtttaat gagatcttct 4260 

gaaactactg cttctaaaca cttgtttgag tggtgagacc ttggataggt gagtgctctt . 4320 

gttacatgtc tgatgcactt gcttgtcctt ttccatrccac atccatgcat tccacatcca 4380 

cgcatttgtc acttatccca tatctgtcat atctgacata cctgtctctt cgtcacttgg 4440 

tcagaagaaa cagatgtgat aatccccagc cgccccaagt ttgagaagat ggcagttgct 4500 

tctttccctt tttcctgcta agtaaggatt ttctcctggc tttgacacct cacgaaatag 4560 

tcttcctgcc ttacattctg ggcattattt caaatatctt tggagtgcgc tgctctcaag 4620 

tttgtgtctt cctactctta gagtgaatgc tcttagagtg aaagagaagg aagagaagat 4680 

gttggccgca gttctctgat gaacacacct ctgaataatg gccaaaggtg ggtgggtttc 4740 

tctgaggaac gggcagcgtt tgcctctgaa agcaaggagc tctgcggagt tgcagttatt 4800 

ttgcaactga tggtggaact ggtgcttaaa gcagattccc taggttccct gctacttctt 4860 

ttccttcttg gcagtcagtt tatttctgac agacaaacag ccacccccac tgcaggctta 4920 

gaaagtatgt ggctctgcct gggtgtgtta cagctctgcc ctggtgaaag gggattaaaa 4980 

cgggcaccat tcatcccaaa caggatcctc attcatggat caagctgtaa ggaacttggg 5040 

ctccaacctc aaaacattaa ttggagtacg aatgtaatta aaactgcatt ctcgcattcc 5100 

taagtcattt agtctggact ctgcagcatg taggtcggca gctcccactt tctcaaagac 5160 

cactgatgga ggagtagtaa aaatggagac cgattcagaa caaccaacgg agtgttgccg 5220 

aagaaactga tggaaataat gcatgaattg tgtggtggac atttttttta aatacataaa 5280 

ctacttcaaa tgaggtcgga gaaggtcagt gttttattag cagccataaa accaggtgag 5340 

cgagtaccat ttttctctac aagaaaaacg attctgagct ctgcgtaagt ataagttctc 5400 

catagcggct gaagctcccc cctggctgcc- tgccatctca gctggagtgc agtgccattt 5460, 
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ccttggggtt 


tctctcacag 


cagtaatggg 


acaatacttc 


acaaaaattc 


tttcttttcc 


5520 


tgtcatgtgg 


gatccctact 


gtgccctcct 


ggttttacgt 


taccccctga 


ctgttccatt 


5580 


cagcggtttg 


gaaagagaaa 


aagaatttgg 


aaataaaaca 


tgtctacgtt 


atcacctcct 


5640 


ccagcatttt 


ggtttttaat 


tatgtcaata 


actggcttag 


atttggaaat 


gagagggggt 


5700 


tgggtgtatt 


accgaggaac 


aaaggaaggc 


ttatataaac 


tcaagtcttt 


tatttagaga 


57 60 


actggcaagc 


tgtcaaaaac 


aaaaaggcct 


taccaccaaa 


ttaagtgaat 


agccgctata 


5820 


gccagcaggg 


ccagcacgag 


ggatggtgca 


ctgctggcac 


tatgccacgg 


cctgcttgtg 


5880 


actctgagag 


caactgcttt 


ggaaatgaca 


gcacttggtg. 


caatttcctt 


tgtttcagaa 


5940 


tgcgtagagc 


gtgtgcttgg 


cgacagtttt 


tctagttagg 


ccacttcttt 


tttccttctc 


6000 


tcctcattct 


cctaagcatg 


tctccatgct 


ggtaatccca 


gtcaagtgaa 


cgttcaaaca 


6060 


atgaatccat 


cactgtagga 


ttctcgtggt 


gatcaaatct 


ttgtgtgagg 


tctataaaat 


6120 


atggaagctt 


atttattttt 


cgttcttcca 


tatcagtctt 


ctctatgaca 


attcacatcc 


6180 


accacagcaa 


attaaaggtg 


aaggaggctg 


gtgggatgaa 


gagggtcttc 


tagctttacg 


6240 


ttcttccttg 


caaggccaca 


ggaaaatgct 


gagagctgta 


gaatacagcc 


tggggtaaga 


6300 


agttcagtct 


cctgctggga 


. cagctaaccg 


catcttataa 


ccccttctga 


gactcatctt 


6360 


aggaccaaat 


agggtctatc 


tggggttttt 


gttcctgctg 


ttcctcctgg 


aaggctatct 


6420 


cactatttca 


ctgctcccac 


ggttacaaac 


caaagataca 


gcctgaattt 


tttctaggcc 


6480 


acattacata 


aatttgacct 


ggtaccaata 


ttgttctcta 


tatagttatt 


tccttcccca 


6540 


ctgtgtttaa 


ccccttaagg 


cattcagaac 


aactagaatc 


atagaatggt 


ttggattgga 


6600 


aggggcctta 


aacatcatcc 


atttccaacc 


ctctgccatg 


ggctgcttgc 


cacccactgg 


6660 


ctcaggctgc 


ccagggcccc 


atccagcctg 


gccttgagca 


cctccaggga 


tggggcaccc 


6720 


acagcttctc 


tgggcagcct 


gtgccaacac 


ctcaccactc 


tctgggtaaa 


gaattctctt 


6780 


ttaacatcta 


atctaaatct 


cttctctttt 


agtttaaagc 


cattcctctt 


tttcccgttg 


6840 


ctatctgtcc 


aagaaatgtg 


tattggtctc 


cctcctgctt 


ataagcagga 


agtactggaa 


6900 


ggctgcagtg 


aggtctcccc 


acagccttct 


cttctccagg 


ctgaacaagc 


ccagctcctt 


6960 


cagcctgtct 


tcgtaggaga 


tcatcttagt 


ggccctcctc 


tggacccatt 


ccaacagttc 


7020 


cacggctttc 


ttgtggagcc 


ccaggtctgg 


atgcagtact 


tcagatgggg 


ccttacaaag 


7080 


gcagagcaga 


tggggacaat 


cgcttacccc 


tccctgctgg 


ctgcccctgt 


tttgatgcag 


7140 


cccagggtac 


tgttggcctt 


tcaggctccc 


agaccccttg ctgattt gtg 


tcaagctttt 


7200 
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catccaccag aacccacgct tcctggttaa tacttctgcc ctcacttctg taagcttgtt 7260 

tcaggagact tccattcttt aggacagact gtgttacacc tacctgccct attcttgcat 7320 

atatacattt cagttcatgt ttcctgtaac aggacagaat atgtattcct ctaacaaaaa 7380 

tacatgcaga attcctagtg ccatctcagt agggttttca tggcagtatt agcacatagt 7440 

caatttgctg caagtacctt ccaagctgcg gcctcccata aatcctgtat ttgggatcag 7500 

ttaccttttg gggtaagctt ttgtatctgc agagaccctg ggggttctga tgtgcttcag 7560 

ctctgctctg ttctgactgc accattttct agatcaccca gttgttcctg tacaacttcc 7620 

ttgtcctcca tcctttccca gcttgtatct ttgacaaata caggcctatt tttgtgtttg 7680 

cttcagcagc catttaattc ttcagtgtca tcttgttctg ttgatgccac tggaacagga 7740 

ttttcagcag tcttgcaaag aacatctagc tgaaaacttt ctgccattca atattcttac 7800 

cagttcttct tgtttgaggt gagccataaa ttactagaac ttcgtcactg acaagtttat 7860 

gcattttatt acttctatta tgtacttact ttgacataac acagacacgc acatattttg 7920 

ctgggatttc cacagtgtct ctgtgtcctt cacatggttt tactgtcata cttccgttat 7980 

aaccttggca atctgcccag ctgcccatca caagaaaaga gattcctttt ttattacttc 8040 

tcttcagcca ataaacaaaa tgtgagaagc ccaaacaaga acttgtgggg caggctgcca 8100 

tcaagggaga gacagctgaa gggttgtgta gctcaataga attaagaaat aataaagctg 8160 

tgtcagacag ttttgcctga tttatacagg cacgccccaa gccagagagg ctgtctgcca 8220 

aggccacctt gcagtccttg gtttgtaaga taagtcatag gtaacttttc tggtgaattg 8280 

cgtggagaat catgatggca gttcttgctg tttactatgg taagatgcta aaataggaga 8340 

cagcaaagta acacttgctg ctgtaggtgc tctgctatcc agacagcgat ggcactcgca 8400 

caccaagatg agggatgctc ccagctgacg gatgctgggg cagtaacagt gggtcccatg 8460 

ctgcctgctc attagcatca cctcagccct caccagccca tcagaaggat catcccaagc 8520 

tgaggaaagt tgctcatctt cttcacatca tcaaaccttt ggcctgactg atgcctcccg 8580 
gatgcttaaa tgtggtcact gacatcttta tttttctatg atttcaagtc agaacctccg , 8640 

gatcaggagg gaacacatag tgggaatgta ccctcagctc caaggccaga tcttccttca 8700 

atgatcatgc atgctactta ggaaggtgtg tgtgtgtgaa tgtagaattg cctttgttat 8760 

tttttcttcc tgctgtcagg aacattttga ataccagaga aaaagaaaag tgctcttctt 8820 

ggcatgggag gagttgtcac acttgcaaaa taaaggatgc agtcccaaat gttcataatc 8880 

tcagggtctg aaggaggatc agaaactgtg tatacaattt caggcttctc tgaatgcagc ■ 8940 
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ttttgaaagc tgttcctggc cgaggcagta ctagtcagaa ccctcggaaa caggaacaaa 9000 

tgtcttcaag gtgcagcagg aggaaacacc ttgcccatca tgaaagtgaa taaccactgc 9060 

cgctgaagga atccagctcc tgtttgagca ggtgctgcac actcccacac tgaaacaaca 9120 

gttcattttt ataggacttc caggaaggat cttcttctta agcttcttaa ttatggtaca 9180 

tctccagttg gcagatgact atgactactg acaggagaat gaggaactag ctgggaatat 9240 

ttctgtttga ccaccatgga gtcacccatt tctttactgg tatttggaaa taataattct 9300 

gaattgcaaa gcaggagtta gcgaagatct tcatttcttc catgttggtg acagcacagt 9360 

tctggctatg aaagtctgct tacaaggaag aggataaaaa tcatagggat aataaatcta 9420 

agtttgaaga caatgaggtt ttagctgcat ttgacatgaa gaaattgaga cctctactgg 9480 

atagctatgg tatttacgtg tctttttgct tagttactta ttgaccccag ctgaggtoaa 9540 

gtatgaactc aggtctctog ggctactggc atggattgat tacatacaac tgtaatttta 9600 

gcagtgattt agggtttatg agtacttttg cagtaaatca tagggttagt aatgttaatc 9660 

tcagggaaaa aaaaaaaaag ccaaccctga cagacatccc agctcaggtg gaaatcaagg 9720 

atcacagctc agtgcggtcc cagagaacac agggactctt ctcttaggac ctttatgtac 9780 

agggcotcaa gataactgat gttagtcaga agactttcca ttctggccac agttcagctg 9840 

aggcaatcct ggaattttct ctccgctgca cagttccagt catcccagtt tgtacagttc 9900 

tggcactttt tgggtcaggc cgtgatccaa ggagcagaag ttccagctat ggtcagggag 9960 

tgcctgaccg tcccaactca ctgcactcaa acaaaggcga aaccacaaga gtggcttttg 10020 

ttgaaattgc agtgtggccc agaggggctg caccagtact ggattgacca cgaggcaaca 10080 

ttaatcctca gcaagtgcaa tttgcagcca ttaaattgaa ctaactgata ctacaatgca 10140 

atcagtatca acaagtggtt tggcttggaa gatggagtct aggggctcta caggagtagc 10200 

tactctctaa tggagttgca ttttgaagca ggacactgtg aaaagctggc ctcctaaaga 10260 

ggctgctaaa cattagggtc aattttccag tgcactttct gaagtgtctg cagttcccca 10320 

tgcaaagctg cccaaacata gcacttccaa ttgaatacaa ttatatgcag gcgtactgct 10380. 

tcttgccagc actgtccttc tcaaatgaac tcaacaaaca atttcaaagt ctagtagaaa 10440 

gtaacaagct ttgaatgtca ttaaaaagta tatctgcttt cagtagttca gcttatttat 10500 

gcccactaga aacatcttgt acaagctgaa cactggggct ccagattagt ggtaaaacct 10560 

actttataca atcatagaat catagaatgg cctgggttgg aagggacccc aaggatcatg 10620 

aagatccaac acccccgcca caggcagggc caccaacctc cagatctggt actagaccag 10680 
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gcagcccagg gctccatcca acctggccat gaacacctcc agggatggag catccacaac 10740 

ctctctgggc agcctgtgcc agcacctcac caccctctct gtgaagaact tttccctgac 10800 

atccaatcta agccttccct ccttgaggtt agatccactc ccccttgtgc tatcactgtc 10860 

tactcttgta aaaagttgat tctcctcctt tttggaaggt tgcaatgagg tctccttgca 10920 

gccttcttct cttctgcagg atgaacaagc ccagctccct cagcctgtct ttataggaga 10980 

ggtgctccag ccctctgatc atctttgtgg ccctcctctg gacccgctcc aagagctcca 1104O 

catctttcct gtactggggg ccccaggcct gaatgcagta ctccagatgg ggcctcaaaa 11100 

gagcagagta aagagggaca atcaccttcc tcaccctgct ggccagcoct cttctgatgg 11160 

agccctggat acaactggct ttctgagctg caacttctcc ttatcagttc caotattaaa 11220 

acaggaacaa tacaacaggt gctgatggcc agtgcagagt ttttcacact tcttcatttc 11280 

ggtagatctt agatgaggaa cgttgaagtt gtgcttctgc gtgtgcttct tcctcctcaa 11340 

atactcctgc ctgatacctc acbccacctg ccactgaatg gctccatggc cccctgcagc 11400 

cagggccctg atgaacccgg cactgcttca gatgctgttt aatagcaoag tatgaccaag 11460 

ttgcacctat gaatacacaa acaatgtgtt gcatccttca gcacttgaga agaagagcca 11520 

aatttgcatt gtcaggaaat ggtttagtaa ttctgccaat taaaacttgt ttatctacca 11580 

tggctgtttt tatggctgtt agtagtggta cactgatgat gaacaatggc tatgcagtaa 11640 

aatcaagact gtagatattg caacagacta taaaattcct ctgtggctta gccaatgtgg 11700 

tacttcccac attgtataag aaatttggca agtttagagc aatgtttgaa gtgttgggaa 11760 

atttctgtat actcaagagg gcgtttttga caactgtaga acagaggaat caaaaggggg 11820 

tgggaggaag ttaaaagaag aggcaggtgc aagagagctt gcagtcccgc tgtgtgtacg 11880 

acactggcaa catgaggtct ttgctaatct tggtgctttg cttcctgccc ctggctgcct 11940 

tagggtgcga tctgcctcag acccacagcc tgggcagcag gaggaccctg atgctgctgg 12000 

ctcagatgag gagaatcagc ctgtttagct gcctgaagga taggcacgat tttggctttc 12060 

ctcaagagga gtttggcaac cagtttcaga aggctgagac catccctgtg ctgcacgaga 12120 

tgatccagoa gatctttaac ctgtttagca ccaaggatag cagcgctgct tgggatgaga 12180 

occtgctgga taagttttac accgagctgt accagoagct gaacgatctg gaggcttgcg 12240 

tgatccaggg ogtgggcgtg accgagaccc ctctgatgaa " ggaggatagc atcctggctg 12300 

tgaggaagta ctttcagagg atcaccctgt acctgaagga gaagaagtac agcccctgcg 12360 

cttgggaagt cgtgagggct gagatcatga ggagctttag cctgagcacc aacctgcaag 12420 
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agagcttgag gtctaaggag taaaaagtct agagtcgggg cggcgcgtgg taggtggcgg 12480 

ggggttccca ggagagcccc cagcgcggac ggcagcgccg tcactcaccg ctccgtctcc 12540 

ctccgcccag ggtcgcctgg cgcaaccgct gcaagggcac cgacgtccag gcgtggatca 12600 

gaggctgccg gctgtgagga gctgccgcgc ccggcccgcc cgctgcacag ccggccgctt 12660 

tgcgagcgcg acgctacccg cttggcagtt ttaaacgcat ccctcattaa aacgactata 12720 

cgcaaacgcc ttcccgtcgg tccgcgtctc tttccgccgc cagggcgaca ctcgcgggga 12780 

gggcgggaag ggggccgggc gggagcccgc ggccaaccgt cgccccgtga cggcaccgcc 12840 

ccgcccccgt gacgcggtgc gggcgccggg gccgtggggc tgagcgctgc ggcggggccg 12900 

ggccgggccg gggcgggagc tgagcgcggc gcggctgcgg gcggcgcccc ctccggtgca 12960 

atatgttcaa gagaatggct gagttcgggc ctgactccgg gggcagggtg aaggtgcggc 13020 

gcgggcggag ggacggggcg ggcgcggggc cgcccggcgg gtgccggggc ctctgccggc 13080 

ccgcccggct cgggctgctg cggcgcttac gggcgcgctt ctcgccgctg ccgcttctct 13140 

tctctcccgc gcaagggcgt caccatcgtg aagccggtag tgtacgggaa cgtggcgcgg 13200 

tacttcggga agaagaggga ggaggacggg cacacgcatc agtggacggt ttacgtgaag 13260 

ccctacagga acgaggtagg gcccgagcgc gtcggccgcc gttctcggag cgccggagcc 13320 

gtcagcgccg cgcctgggtg cgctgtggga cacagcgagc ttctctcgta ggacatgtcc 13380 

gcctacgtga aaaaaatcca gttcaagctg cacgagagct acgggaatcc tctccgaggt 13440 

gggtgttgcg tcggggggtt tgctccgctc ggtcccgctg aggctcgtcg ccctcatctt 13500 

tctttcgtgc cgcagtcgtt accaaaccgc cgtacgagat caccgaaacg ggctggggcg 13560 

aatttgaaat catcatcaag atatttttca ttgatccaaa cgagcgaccc gtaagtacgc 13620 

tcagcttctc gtagtgcttc ccccgtcctg gcggcccggg gctgggctgc tcgctgctgc 13680 

cggtcacagt cccgccagcc gcggagctga ctgagctccc tttcccggga cgtgtgctct 13740 

gtgttcggtc agcgaggcta tcgggagggc tttggctgca tttggcttct ctggcgctta 13800 

gcgcaggagc acgttgtgct acgcctgaac tacagctgtg agaaggccgt ggaaaccgct 13860 

ctcaaactga tttattggcg aaatggctct aaactaaatc gtctcctctc tttggaaatg 13920 

ctttagagaa ggtctctgtg gtagttctta tgcatctatc ctaaagcact tggccagaca 13980 

atttaaagac atcaagcagc atttatagca ggcacgttta ataacgaata ctgaatttaa 14040 

gtaactctgc tcacgttgta tgacgtttat tttcgtattc ctgaaagcca ttaaaatcct 14100 

gtgcagttgt ttagtaagaa cagctgccac tgttttgtat ctaggagata actggtgttt 14160 
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ccctacagtt ctcaagctga taaaactctg . tctttgtatc taggtaaccc tgtatcactt 14220 

gctgaagctt tttcagtctg acaccaatgc aatcctggga aagaaaactg tagtttctga 14280 

attctatgat gaaatggtat gaaaatttta atgtcaaccg agcctgactt tatttaaaaa 14340 

aaattattga tggtgctgtg tattttggtc cttccttaga tatttcaaga tcctactgcc 14400 

atgatgcagc aactgctaac gacgtcccgt cagctgacac ttggtgctta caagcatgaa 144 60 

acagagtgta agtgcaaaat gaggatacct tcgccgaccg tcattcacta ctaatgtttt 14520 

ctgtgggatg tgatcgtaca gtgagtttgg ctgtgtgaaa tttgaatagc ttggtattgg 14580 

cagtgatgac gtgatcgatg ccttgcttat catgtttgaa atgaagtaga ataaatgcag 14640 

cctgctttat ttgagatagt ttggttcatt ttatggaatg caagcaaaga ttatacttcc 14700 

tcactgaatt gcactgtcca aaggtgtgaa atgtgtgggg atctggagga ccgtgaccga 14760 

gggacattgg atcgctatct cccatttctt ttgctgttac cagttcagat tttcttttca 14820 

cctagtcttt aattcccagg gttttgtttt ttccttggtc atagtttttg tttttcactc 14880 

tggcaaatga tgttgtgaat tacactgctt cagccacaaa actgatggac tgaatgaggt 14940 

catcaaacaa acttttcttc ttccgtattt cctttttttt cccccactta tcatttttac 15000 

tgctgttgtt gagtctgtaa ggctaaaagt aactgttttg tgctttttca ggacgtgtgc 15060 

tttccaaatt actgccacat atataaagaa aggttggaat tttaaagata attcatgttt 15120 

cttcttcttt tttgccacca cagttgcaga tcttgaagta aaaaccaggg aaaagctgga 15180 

agctgccaaa aagaaaacca gttttgaaat tgctgagctt aaagaaaggt taaaagcaag 15240. 

tcgtgaaacc atcaactgct taaagagtga aatcagaaaa ctcgaagagg atgatcagtc 15300 

taaagatatg tgatgagtgt tgacttggca gggagcctat aatgagaatg aaaggacttc 15360 

agtcgtggag ttgtatgcgt tctctccaat tctgtaacgg agactgtatg aatttcattt 15420 

gcaaatcact gcagtgtgtg acaactgact ttttataaat ggcagaaaac aagaatgaat 15480 

gtatcctcat tttatagtta aaatctatgg gtatgtactg gtttatttca aggagaatgg 15540 

atcgtagaga cttggaggcc agattgctgc ttgtattgac tgcatttgag tggtgtagga 15600 

acattttgtc tatggtcccg tgttagttta cagaatgcca ctgttcactg ttttgttttg 15660 

tattttactt tttctactgc aacgtcaagg ttttaaaagt tgaaaataaa acatgcaggt 15720 

tttttttaaa tatttttttg tctctatcca gtttgggctt caagtattat tgttaacagc 15780 

aagtcctgat ttaagtcaga ggctgaagtg taatggtatt caagatgctt aagtctgttg- 15840 

tcagcaaaac aaaagagaaa acttcataaa atcaggaagt tggcatttct aataacttct 15900 
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ttatcaacag 

tatattttgt 

taacacttgt 

ccctcttctt 

tcttctttac 

ccacaaacct 

cactggtttg 

aaacagctgt 

atcaggcatg 

cttcctggcg 

agagcaacta 

gtaagtataa 

aagacaaatt 

agaagccctc 

attgctatcg 

ttaaatttgc 

cagctctgtg 

gtctgagagg 

atggccggtt 

tgcccaagga 

ctctgggcag 

atgatcactg 

tcatatgtac 

ggtaaaagag 

atttattatt 

caatgggaac 

cagtagcagg 

tagcgttagt 

acttataagc 



ataagagttt 

gtgtttggat 

actctgctgt 

ttgtagcctc 

tgctttgaga 

catccccttc 

aaacgtttga 

gacagtatga 

tttcattgag 

ctgcagacca 

tttttatcag 

atgcatcggc 

agctttgctt 

ttctgaggct 

aaggatcctt 

ttttaagctg 

caaagtgctc 

gctcagactg 

tctctcctgc 

ggccgtggat 

cctggtctgg 

tggtcctttt 

tgagagagga 

tcaggtttta 

tctttggggt 

gaaaaagaac 

ctcagaactg 

agacttccat 

atgttggaag 



ctagccctgc 

gcaggaagtg 

gcttagcctt 

atacagatgg 

ttctgtgttg 

ttctatagca 

taattagcat 

actcagtatg 

gaggacgggg 

atttttctgg 

ccacaggcaa 

tccagagcac 

gggtgcacag 

agaactgact 

gttaacaaag 

tagctgaaaa 

tctggtctca 

gcttcgtttg 

agcttatgaa 

gccccatccc 

tggttggcga 

caacccaggc 

aacaaacaca 

aaattcaaaa 

tctttttgat 

atgccaggca 

ccagtctttc 

cactgggcag 

ttttgatgtt 



atctactttc 
attcctactc 
tccccatgaa 
cagaccctca 
cagtaacctc 
atcagtatta 
ttaacatggc 
gagacttcat 
caacttgcac 
cattctgact 
tttgcttgta 
tttgagtata 
aacattttta 
ttgacaagta 
ttaatcgtct 
agaacgtgct 
ccagcctttt 
tttgaacagc 
actgttcaca 
tggaggcatt 
tcctgcacat 
tattctatga 
agtgctactg 
tctgtctggt 
gctttatctt 
ctgtcctgga 
cacagtatta 
tgcttagtga 
cctggatgca 



acttatgtag 

tgttatgtag 

aattcagcgg 

ggcttataaa 

tgccagagag 

ctaatgcttt 

ttggtaaaga 

taagacaaac 

cagtggtgcc 

gccgttgctg 

gtattttcca 

cttattaaaa 

gttccagcct 

gagaaactgg 

tttaaggttt 

gtcttccatg 

aattgccggg 

gtgtactgct 

ctgaacacac 

caaggccagg 

agcagcgggg" 

ttctatgatt 

tttgcaagtt 

tttggtgttt 

tctctgccag 

ttgcacacgc 

ctttctaaac 

atgctctgtg 

gtagggaagg 



ttgatgcctt 
atattctatt 
ctgtaaatcc 
ggcttgggca 
gagaaaagcc 
gagaacagag 
tgcagaactg . 
agctgttaaa 
cacacaaatc 
ctggtcacag 
agtgttgtag 
acataaatga 
gctttttggt 
caacggagct 
ggtttattca 
caccaggtgg 
attctggcac 
ttctgtagac 
' tggaacaggt 
ctggatgtgg 
ttgaaactcg 
caacagcaaa 
ttgttcattt 
tttttttttt 
-gactgtgtga 
tggttgcact 
ctaattttaa 
tgaacgtttt 
acagattagc 
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tatgtgaaaa gtagattctg agtatcgggg ttacaaaaag tatagaaacg atgagaaatt 17700 

cttgttgtaa ctaattggaa tttctttaag cgttcactta tgctacattc atagtatttc 177 60 

catttaaaag taggaaaagg taaaacgtga aatcgtgtga ttttcggatg gaacaccgcc 17820 

ttcctatgca cctgaccaac ttccagagga aaagcctatt gaaagccgag attaagccac 17880 

caaaagaact catttgcatt ggaatatgta gtatttgccc tcttcctccc gggtaattac 17940 

tatactttat agggtgctta tatgttaaat gagtggctgg cactttttat tctcacagct 18000 

gtggggaatt ctgtcctcta ^ggacagaaac aattttaatc tgttccactg gtgactgctt 18060 

tgtcagcact tccacctgaa gagatcaata cactcttcaa tgtctagttc tgcaacactt 18120 

ggcaaacctc acatcttatt tcatactctc ttcatgccta tgcttattaa agcaataatc 18180 

tgggtaattt ttgttttaat cactgtcctg accccagtga tgaccgtgtc ccacctaaag 18240 

ctcaattcag gtcctgaatc , tcttcaactc tctatagcta acatgaagaa tcttcaaaag 18300 

fctaggtctga gggacttaag gctaactgta gatgttgttg cctggtttct gtgctgaagg 18360 

ccgtgtagta gttagagcat tcaacctcta g 18391 

<210> 11 
<211> 586 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> MDOT artificial promoter 



<400> 11 
gtaccgggcc 


ccccctcgag gtgaatatcc 


aagaatgcag aactgcatgg aaagcagagc 


60 


tgcaggcacg 


atggtgctga gccttagctg 


cttcctgctg ggagatgtgg atgcagagac 


120 


gaatgaagga 


cctgtccctt 


actcccctca 


gcattctgtg ctatttaggg ttctaccaga 


180 


gtccttaaga 


ggtttttttt 


ttttttggtc 


caaaagtctg tttgtttggt tttgaccact 


240 


gagagcatgt 


gacacttgtc 


tcaagctatt 


aaccaagtgt ccagccaaaa tcgatgtcac 


300 


aacttgggaa 


ttttccattt 


gaagcccctt 


gcaaaaacaa agagcacctt gcctgctcca 


360 


gctcctggct 


gtgaagggtt 


ttggtgccaa 


agagtgaaag gcttcctaaa aatgggctga 


420 


gccggggaag 


gggggcaact 


tgggggctat 


tgagaaacaa ggaaggacaa acagcgttag 


480 


gtcattgctt 


ctgcaaacac 


agccagggct 


gctcctctat aaaaggggaa gaaagaggct 


540 


ccgcagccat 


cacagaccca 


gaggggacgg 


tctgtgaatc aagctt 


586 



<210> 12 

-36- 



WO 03/024199 



PCT/US02/30156 



<211> 11 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> SV40 terminator 

<400> 12 

Cys Gly Gly Pro Lys Lys Lys Arg Lys Val Gly 
1 5 10 



<210> 13 

<211> 12 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer SaltoNotI 

<400> 13 12 
tcgagcggcc gc 



<210> 14 
<211> 83 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

atggctttga cctttgcctt actggtggct ctcctggtgc tgagctgcaa gagcagctgc 60 
tctgtgggct gcgatctgcc tea 

<210> 15 
<211> 100 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

JaccJacagc ctgggcagca ggaggaccct gatgetgetg gctcagatga ggagaatcag 60 
cctgtttagc tgectgaagg ataggcacga ttttggcttt 

<210> 16 
<211> 62 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

ctcaagagga gtttggcaac cagtttcaga aggctgagac catccctgtg ctgcacgaga 60 
tg 

<210> 17 
<211> 94 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223>primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid. 

tccagcagat ctttaacctg tttagcacca aggatagcag cgctgcttgg gatgagaccc 60 
tgctggataa gttttacacc gagctgtacc agca 

<210> 18 
<211> 77 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

ctgaacgatc tggaggcttg cgtgatccag ggcgtgggcg tgaccgagac ccctctgatg 60 
aaggaggata gcatcct 

<210> 19 
<2U> 82 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation . 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

gctgtgJgga agtactttca gaggatcacc ctgtacctga aggagaagaa gtacagccct 60 
tgcgcttggg aagtcgtgag gg 

<210> 20 
<211> 65 
<212> DNA 

.<213> Artificial Sequence 
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<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
' 2b-encoding nucleic acid 

ctgagatcat gaggagcttt agcctgagca ccaacctgca agagagcttg aggtctaagg 60 
agtaa 

<210> 21 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 21 34 
cccaagcttt caccatggct ttgacctttg cctt 

<210> 22 

<211> 19 , 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 22 19 
atctgcctca gacccacag 

<210> 23 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 



<400> 23 

gattttggct ttcctcaaga ggagtt 

<210> 24 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 
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<400> 24 

gcacgagatg atccagcaga t 

<210> 25 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 25 

atcgttcagc tgctggtaca 

<210> 26 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 26 

cctcacagcc aggatgctat 

<210> 27 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 27 

atgatctcag ccctcacgac 

'<210> 28 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 28 

ctgtgggtct gaggcagat 

<210> 29 
<211> 26< 
<212> DNA 

<213> Artificial Sequence 
<220> 
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20 



20 



20 
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<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 29 

aactcctctt gaggaaagcc aaaatc 

<210> 30 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 30 

atctgctgga tcatctcgtg c 

<210> 31 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 31 

tgctctagac tttttactcc ttagacctca agctct 



<210> 32 
<211> 25 
<212> DNA 

<213> Artificial Sequence^ 
<220>^ 

<223> primer used in the synthesis of the MDOT promoter 
<400> 32 

tcactcgagg tgaatatcca agaat 

<210> 33 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the synthesis of the MDOT promoter 
<40O> 33 

gagatcgatt ttggctggac acttg 

<210> 34 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> primer used in the synthesis of the MOOT promoter 
<400> 34 

cacatcgatg tcacaacttg ggaat 

<210> 35 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the synthesis of the MDOT promoter 
<400> 35 

tctaagcttc gtcacagacc gtccc 
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